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PYRIMIDINE DERIVATIVES AND 
OLIGONUCLEOTIDES CONTAINING SAME 

BACKGROUND OF THE INVENTION 

Tbis invention relates to the field of labels, particularly 
labels for diagnostic or analytical use. In particular, it relates 
to oligonucleotides that are modified to enhance the binding 
affinity or the binding specificity of the oligonucleotides for 
complementary sequences and that in addition optionally 
bear a readily detectable characteristic. 

Sequence specific binding of oligonucleotides both to 
single stranded RNA and DNA and to duplex DNA is widely 
known. This phenomenon has been harnessed for a great 
variety of diagnostic, therapeutic and analytical, e.g., 15 
sequence determination or gene mapping, purposes. 
Previously, one objective of research in this field has been to 
increase the affinity of such oligonucleotides for their 
complementary sequences. For example, workers have 
described oligonucleotides containing .5-substituted pyrimi- 20 
dine bases that substantially increase the Tm for oligonucle- 
otide binding to complementary bases (International Publi- 
cation No. WO 93/10820). 

Publications have described the use of fluorescent 
cytosine derivatives to prepare labeled DNA probes. See 
Inoue et al., Jpn. Kokai JP 62059293, (1987). In addition, 
fluorescent labeled nucleotides have been employed in DNA 
sequencing. See Prober et al., "Science" 238:336-341 
(1987). 

30 

13-Dihydro-2H-imidazo[4,5-b]-quinolin-2-one deriva- 
tives as phosphodiesterase inhibitors are disclosed by Raey- 
mackcrs ct al: (EP 541,153). 

U.S. Pat. No. 5,502,177, discloses phenoxazine 
polycycle-containing oligonucleotides and monomeis for 35 
preparing the oligonucleotides. 
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(1) 




and tautomers, solvates and salts thereof, wherein 

R^ is a binding partner, a protecting group, a linker or 
-H; 

R^ is A(Z)yi, wherein A is a spacer and Z independently 
is a label bonding group optionally bonded to a detect- 
able label, but R^ is not amine, protected amine, nitro 
or cyano; 

R^'^ is independently — CH=^, — ^N=, — C(Ci.8 alkyl)= 
or — C(halogen)=, but no adjacent R^^ are both 
— N=, or two adjacent R^^ are taken together to form 
a ring having the structure. 




OBJECTS OF THE INVENTION 

The invention compositions or methods accomplish one 4o 
or more of the following objects. 

An object of this invention is to increase the affinity of 
oligonucleotides for their complementary sequences. 

An object of this invention is to increase the specificity of 45 
oligonucleotides for their complementary sequences. 

Another object of this invention is to provide detectable 
labels for use in diagnostic assays. 

Another object is to enhance diagnostic assays that use 
oligonucleotides. 

Another object is to improve the therapeutic efficacy of 
oligonucleotides. 

Another object is to improve the potency of oligonucle- 
otides as antisense reagents that affect gene expression by 
altering intracellular metabolism of complementary RNA 
sequences encoding a target gene(s). 

Another object is to provide chemical intermediates and 
synthesis methods to prepare the invention compositions, 

These and other objects of the invention will be apparent 
when one considers the disclosure as a whole. 

SUMMARY OF THE INVENTION 

65 

In accordance with the objects, the invention provides 
compounds having the structure (1) 



where R^ is independently — CH=, — N===, — C(Ci.8 
alkyl>= or — C(faaiogen)==, but no adjacent R* are both 
-N=; 

is , or — N(CH3)— ; and 
XI is 1, 2 or 3. 

When the binding partner R^ is an oligonucleotide, 
embodiments of the compounds of this invention include 
oligonucleotides of structure (2), (2A), (2B), or (2C) 
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-continued 

(2A) 




wherein 

D is — OH, protected — OH, an oligonucleotide coupling 
group or a solid support; 

D"^ is an oligonucleotide coupling group, — OH, protected 
— OH or a solid support, wherein is bonded to one 
2' or 3' position in the oHgonucleotide of structure (2) 
and the adjacent 2' or 3* position in structure (2) is 
substituted with R^^, provided that D and are not 
both an oligonucleotide coupling group or they are not 
both a solid support; 

is -<:0^R\ — C(0)N(R^)2, — SOsR^ — S02N(R^)2 
or an activated derivative of — CO2H or — SO3H; 

is a protecting group, — H or — (CHJ^-e — ^(R^)^; 
R'^ is independently a phosphodiester linkage or a phos- 
phodiester substitute linkage, wherein R"^ is bonded to 
one 2" or 3' position in the structure (2) oligonucleotide 



i8483 

4 

and the adjacent 2' or 3' position in structure (2) is 
substituted with R^^; 
R^ is independently — H or a protecting group; 

R^^ is independently — H, — OH, halogen or a moiety that 
^ enhances the oligonucleotide against nuclease cleav- 
age; 

R^'^ is independently — O— , — CHa— or — CF2— ; 
n is an integer from 0 to 98; and 
10 B independently is a purine or pyrimidine base or a 

protected derivative thereof, provided that at least one 

B is a base of structure (3) 

(3) 
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Embodiments include compositions useful as intermedi- 
ates in making the structure (1) compounds, including 
intermediates having structure (4) 

(4) 
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45 and tautomers, solvates and salts thereof wherein, 
R^"^ is a halogen; and 
R^^ is — SH, —OH, =S or =0, 
In a further embodiment, the invention includes contact- 
ing a structure (2), (2A), (2B), or (2C) oligonucleotide, 

50 wherein n is at least about 7, with a sample suspected to 
contain a nucleic acid having a base sequence that is at least 
substantially complementary to the structure (2), (2A), (2B), 
or (2C) oligonucleotide. 
In a further embodiment, the invention includes detectmg 

55 the presence, absence or amount of a complex comprising a 
structure (2), (2A), (2B), or (2C) oligonucleotide, wherein n 
is at least about 7, and a nucleic acid having a base sequence 
that is at least substantially complementary to the structure 
(2), (2A), (2B), or (2C) oligonucleotide. 

60 In a farther embodiment, the invention includes convert- 
ing a structure (4) compound to a compound of structure (1) 
where R^ is — 0 — or — S — and the R^ atom or moiety 
alpha to the ring containing R^'^ is — 0 — , — S — or 
--CH2— by displacing R^^. 

65 In a further embodiment, the invention includes convert- 
ing a structure (4A) compound (a structure (1) compound 
where R^ is replaced with — NHj); to a compound of 
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structure (1), by reductive alkylation of the — NH2 group to 
yield a structure (1) compound where the moiety alpha to 
the ring containing R^"^ is — NH — . 

DETAILED DESCRIPTION OF THE 
INVENTION 

The invention comprises all combinations formed by 
independently selecting individual Markush group members 
and assembling them in accordance with the teachings 
herein. The invention optionally excludes any feature or 
substance found in, or obvious over, the prior art. 

Structural formulas are designated as parenthetical 
numerals. We intend that designation of aromatidty with 
respect to carbocycles and heterocycles herein includes any 
highly resonant unsaturated ring structure. Alternatively, 
placement of double bonds, where indicated, represents one 
potential structure for the depicted compound but we intend 
this depiction to include other resonant states of the com- 
pound as well as protonated and charged species, only one 
of which a structure may show. 

The invention includes invention compounds in 
unpurified, substantially purified and purified forms. It 
includes invention compounds that are present with any 
additional component(s) such as a solvent, reactant or 
by-product that is present during invention compound syn- 
thesis or purification, and any additional component(s) that 
is present during the use or manufacture of an invention 
compound. 

Halo and halogen mean F, CI, Br or I. 

Alkyl means unbranched, branched or cyclic hydrocar- 
bons that are saturated or unsaturated, or combinations 
thereof. Alkyl includes all isomers, e.g., stereoisomers, 
positional isomers, diastereomers and regioisomers. Alkyl 
moieties that are unsaturated will typically contain 1, 2, 3 or 
more — CH=CH — or — C^C — groups, usually one such 
group. 

Substituted aJkyl means unbranched, branched or cyclic 
hydrocarbons that are saturated or unsaturated, or combina- 
tions thereof, where the hydrocarbon contains a heteroatom 
linked to a carbon or a heteroatom that replaces a carbon 
atom. Substituted alkyl includes all isomers, e.g., 
stereoisomers, positional isomers, diastereomers and regioi- 
somers. Substituted alkyl moieties that are unsaturated will 
typically contain 1, 2, 3 or more — CH=CH — or — te— 
C — groups, usually one such group. Substituted alkyl 
includes alkyl groups having substituents linked to a carbon 
atom or substituents that interrupt a carbon atom diain, and 
unless otherwise defined, substituents include ethers 
(— O— ketones (^(0)—), — OR^, — C(0)OR^ -~-C(0) 
O— , >>~OC(0)— , ^(0)H, — OCH2— , — OCH.CH2— , 
— OCHsO— — OCH2CH2O— , — NR^— — NHR^ 
— NHC(0)— , — C(0)NH— , C(0)NHR^ — OC(0)NR^— , 
— 0C(0)NHR^ — NR^C(0)NR^— , — NR^C(0)NHR^ 
— NR^CH^— , — NR^CH.CH^— — S— — SR^ 
— S(0)— , -^(0)(0>--, ^(0)0R^ — S(0)H, halogen, 
CN, NO2, and combinations of these substituents where R^ 
is hydrogen or a protecting group. 

The invention compounds herein do not include obvi- 
ously unstable structures, e.g., — O — O — , — 0 — S — or 
unsaturated cyclopropyl, unless they are usefid as transitory 
intermediates in the preparation of more stable compounds. 

As used herein, "monosaccharide" means a poly hydroxy 
aldehyde or ketone having the empirical formula (CH20)„ 
where n is 3, 4, 5, 6 or 7. Monosaccharide includes open- 
chain and closed-chain forms, but will usually be closed 
chain forms. Monosaccharide includes hexofuranose and 
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pentofuranose sugars such as 2'-deoxyribose, ribose, 
arabinose, xylose, their 2'-deoxy and 3 -deoxy derivatives, 
their 2',3'-didcoxy derivatives, and their derivatives contain- 
ing R^ linked to the 2' or 3' position, usually the 2' position. 

5 Monosaccharide also includes the 2*3* dideoxydidehydro 
derivative of ribose. Monosaccharides include the D- and 
L-isomers of glucose, fiiictose, mannose, idose, galactose, 
aUose, gulose, altrose, talose, fucose, erj^rose, threose, 
lyxose, erythrulose, ribulose, xylulose, ribose, arabinose, 

10 xylose, psicose, sorbose, tagatose, glyceraldehyde, dihy- 
droxyacetone and their monodeoxy derivatives such as 
rhamnose. Monosaccharides are optionally protected or par- 
tially protected. 

As used herein, a "protecting group" means a moiety that 

15 prevents the atom to which it is linked from participating in 
unwanted reactions. For example, for — OR^, R^ is a pro- 
tecting group for the oxygen atom found in a hydroxyl or 
carboxyl group, for — SR^, R^ is a protecting group for 
sulfur in thiols for instance, and for — NHR^ or — N(R^) — , 

20 is a nitrogen atom protecting group for primary or 
secondary amines. Hydroxyl, amine and other reactive 
groups are found in invention compounds at, e.g., R^ R^^ or 
ol%onucieotide linkages or oHgonucleotide bases. These 
groups may require protection against reactions taking place 

25 elsewhere in the molecule. The protecting groups for 
oxygen, sulfur or nitrogen atoms are usually used to prevent 
unwanted reactions with electrophiHc compounds, such as 
acylating or phosphorylating agents used, e.g., in 
nucleoside, nucleotide or oligonucleotide chemistry. 

Protecting groups are intended to be removed by known 
procedures^ although it will be understood that the protected 
intermediates fall within the scope of this invention. The 
removal of the protecting group may be arduous or straight- 
forward, depending upon the economics and nature of the 
conversions involved. In general, one will use a protecting 
group with exocychc amines in the B groups of the com- 
pounds of this invention. For oligonucleotide containing 
such B groups to be fully binding competent, exocycHc 
amines must be deprotected because the amine groups 

^ participate in hydrogen bonding with complementary bases. 
Similarly, one will typically use reversible protecting groups 
for the 5' and 3' hydroxyl groups of pentofuranose sugars in 
nucleotides intended for use as monomers in synthesis of 
oligonucleotides containing 3,5' linkages. Protecting groups 
commonly are employed to protect against covalent modi- 
fication of a sensitive group in reactions such as 
phosphorylation, alkylation or acylation. Ordinarily, protect- 
ing groups are removed by, e.g. hydrolysis, elimination or 
aminolysis. Thus, simple functional considerations will suf- 

50 fice to guide the selection of a reversible or an irreversible 
protecting group at a given locus on the invention com- 
pounds. Suitable protecting groups and criteria for their 
selection are described in T. W. Greene and P. G. M. Wuts, 
Eds. "Protective Groups in Organic Synthesis" 2nd edition, 

^5 WHey Press, at pps. 10-142, 143-174, 175-223, 224-276, 
277-308, 309-405 and 406-454. 

Salts 

Embodiments include salts and complexes of invention 
60 compounds, including pharmaceutically acceptable or salts 
that are relatively non-toxic. The invention compounds may 
have one or more moieties that carry at least a partial 
positive or negative charge in aqueous solutions, typically at 
a pH of about 4-10, that can participate in forming a salt, a 
65 complex, a composition with partial salt and partial complex 
properties or other noncovalent interactions, all of which we 
refer to as a "salt(s)". Salts are usually biologically com- 
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patible or pharmaceutically acceptable or non-toxic, particu- 
larly for mammalian cells. Salts that are biologically toxic 
are optionally used with synthetic intermediates of invention 
compounds. When a water soluble composition is desired, 
monovalent salts are usually preferred. 5 

Metal salts typically are prepared by reacting the metal 
hydroxide with a compound of this invention. Examples of 
metal salts which are optionally prepared in this way are 
salts containing Li^, Na"^, and K"^. A less soluble metal salt 
can be precipitated from the solution of a more soluble salt 
by adding a suitable metal compoimd. Invention salts may 
be formed from acid addition of certain organic acids, such 
as organic carboxylic acids, and inorganic acids, such as 
alkylsulfonic adds or hydrogen halide acids, to acidic or 
basic centers on invention compounds, such as basic centers 
on the invention pyrimidine base analogs. Metal salts 
include ones containing Na*, Ii% K*, Ca"*^ or Mg*"*". Other 
metal salts may contain aluminum, barium, strontium, 
cadmium, bismuth, arsenic or zinc ion, 

Salt(s) of invention compounds may comprise a combi- 
nation of appropriate cations such as alkali and alkaline 
earth metal ions or ammonium and quaternary ammonium 
ions with the acid anion moiety of the phosphoric acid or 
phosphonic acid group, which may be present in invention 
polymers or monomers. ^ 

Salts are produced by standard methods, including dis- 
solving free base in an aqueous, aqueous-alcohol or 
aqueous-organic solution containing the selected acid, 
optionally followed by evaporating the solution. The free 
base is reacted in an organic solution containing the acid, in 3Q 
which case the salt usually separates directly or one can 
concentrate the solution. 

Suitable amine salts include amines having suflSdcnt 
basicity to form a stable salt, preferably amines of low 
toxicity including trialkyl amines (tripropylamine, 35 
triethylamine, trimethylamine), procaine, dibenzylamine, 
N-benzyl-betaphenethylamine, ephenamine, N,N'- 
dibenzylethylenediamine, N-cthylpiperidine, benzylamine 
and dicyclohexylamine. 

Salts include organic sulfonic acid or organic carboxylic 40 
acid salts, made for example by addition of the acids to basic 
centers, typically amines. Exemplary sulfonic adds include 
^6-16 aryl sulfonic acids, C^.^^ heteroaryl sulfonic acids and 
Ci_;i6 alkyl sulfonic acids such as phenyl sulfonic acid, 
a-naphthalene sulfonic acid, |3-naphthalene sulfonic acid, 45 
(S)-camphorsulfonic acid, methyl (CH3SO3H), ethyl 
(C2H5SO3H), n-propyl, i-propyl, n-butyl, s-butyl, i-butyl, 
t-butyl, pentyl and hexyl sulfonic acids. Exemplary organic 
carboxylic acids include C^.^g alkyl, Cg.^^ aryl carboxylic 
acids and C4_i^ heteroaryl carboxyHc acids such as acetic, 50 
glycolic, lactic, pyruvic, malonic, giutaric, tartaric, citric, 
fumaric, sucdnic, malic, maleic, oxaHc, hydioxymaleic, 
benzoic, hydroxybenzoic, phenylacetic, cinnamic, salicylic, 
nicotinic and 2-phenoxybenzoic. 

Invention salts include those made from inorganic adds, 55 
e.g., HF, HCl, HBr, HI, H2SO4, H3PO4, Na2C03, K2CO3, 
CaC03, MgCOg and NaQOs. Suitable anions, which are 
optionally present with a cation such a Ca"""^, Mg""^, Li^, Na"" 
or K^, include arsenate, arsenite formate, sorbate, chlorate, 
perchlorate, periodate, dichromate, glycodeoxycholate, 60 
cholate, deoxycholate, desoxycholate, taurocholate, 
taurodeoxycfaolate, taurolithocholate, tetraborate, nitrate, 
nitrite, sulfite, sulfamate, hyposulfite, bisulfite, 
metabisulfite, thiosulfate, thiocyanate, silicate, metasilicate, 
CN", gluconate, gulcuronate, hippurate, picrate, 65 
hydrosulfite, hexafluorophosphate, hypochlorite, 
hypochlorate, borate, metaborate, tui^tate and urate. 
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Salts also include the invention compound salts with one 
or more amino adds. Many amino acids are suitable, espe- 
cially the naturaliy-occurrir^ amino adds found as protein 
components, although the amino acid typically is one bear- 
ing a side chain with a basic or acidic group, e.g., lysine, 
arginine, histidine or glutamic add, or a neutral group such 
as glycine, serine, threonine, alanine, isoleucine, or leucine. 

The invention compositions include compounds in their 
un-ionized, as well as zwitterionic form, and combinations 
with stoiochimetric amounts of water as in hydrates. 

Stereoisomers 

The compounds of the invention include enriched or 
resolved optical isomers at any or all asymmetric atoms as 
are apparent horn the depictions. Both racemic and dias- 
teromeric mixtures, as well as the indi\ddual optical isomers 
isolated or synthesized so as to be substantially free of their 
enantiomeric or diastereomeric partners, are all within the 
scope of the invention. Chiral centers may be found in 
invention compounds at, for example, R^, and R^-^. 

One or more of the following enumerated methods are 
used to prepare the enantiomerically enriched or pure iso- 
mers herein. The methods are listed in approximately their 
order of preference, i.e., one ordinarily should employ 
stereospecific synthesis from chriral precursors before chro- 
matographic resolution before spontaneous crystalhzation. 

Stereospedfic synthesis is described in the examples. 
Methods of this type conveniently are used when the appro- 
priate chuai starting material is available and reaction steps 
are chosen do not result in undesired racemization at chiral 
sites. One advantage of stereospecific synthesis is that it 
does not produce undesired enantiomers that must be 
removed from the final product, thereby lowering overall 
synthetic yield. In general, those skilled in the art would 
understand what starting materials and reaction conditions 
should be used to obtain the desired enantiomerically 
enriched or pure isomers by stereospedfic synthesis. If an 
unexpected racemization occurs in a method thought to be 
stereospecific then one needs only to use one of the follow- 
ing separation methods to obtain the desired product. 

If a suitable stereospecific synthesis cannot be empirically 
designed or determined with routine experimentation then 
those skilled in the art would turn to other methods. One 
method of general utility is chromatographic resolution of 
enantiomers on chiral chromatography resins. These resins 
are packed in columns, commonly called Pirkle columns, 
and are commercially available. The columns contain a 
chiral stationary phase. 'ITie racemate is placed in solution 
and loaded onto the column, and thereafter separated by 
HPLC, See for example. Proceedings Chromatographic 
Society — Intemational Symposium on Chiral Separations, 
Sep. 3-4, 1987. Examples of chiral columns that could be 
used to screen for the optimal separation technique would 
include Diacel Chriacel OD, Regis Pirkle Covalent 
D-phenylglycine, Regis Pirkle Type lA, Astec Cyclobond II, 
Astec Cyclobond III, Serva Chiral D-DL=Daltosil 100, 
Bakerbond DNBLeu, Sumipax OA-1000, Merck Cellulose 
Triacetate column, Astec Cyclobond I-Beta, or Regis Pirkle 
Covalent D-Naphthylalanine. Not all of these columns are 
likely to be effective with every racemic mixture. However, 
those skilled in the art understand that a certain amount of 
routine screening may be required to identify the most 
effective stationary phase. When using such columns it is 
desirable to employ embodiments of the compounds of this 
invention in which the charges are not neutralized, e.g., 
where addic functionalities such as carboxyl are not esteri- 
fied or amidated. 
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Another method entails converting the enantiomers in the polycycle and is termed the polycyclic substructure. 
mixture to diasteriomers with chiral auxiharies and then consists of two principal structural features denominated 
separating the conjugates by ordinary column chromatog- — A(Z)^,. Group A is a spacer that is used to position the Z 
raphy. This is a very suitable method, particularly when the group(s) and attach it to the remainder of the polycycle. The 
enibodiment contains free carboxyl, amino or hydroxyl that 5 Z group(s) serve as a site for attachment of a detectable label 
will form a salt or covalent bond to a chiral auxiliary. or to enhance the hydrogen bonding of the polycycle to the 
Chirally pure amino acids, organic acids or organosulfonic complementary guanine base. In some embodiments, Z is 
acids arc aU worthwhHc exploring as chiral auxHiaries, all of capable of performing both functions. For the most part, Z 
which are well known in the art. Salts with such auxiliaries groups capable of hydrogen bonding are useful as sites for 
can be fonned, or they can be covalently (but reversibly) 10 covalent bonding to detectable labels, but not all Z groups 
bonded to the functional group. For example, pure D or L that are useful as label-bonding sites are capable of hydro- 
ammo acids can be used to amidate the carboxyl group of gen bonding to guanine. Z contains at least one atom other 
embodiments of this invention and then separated by chro- than carbon, typically O, N or S. In any case, the R^ groups 
matography. Enzymatic resolution is another method of possess at least one of these practical utiHties. It would be 
potential value. In such methods one prepares covalent 15 routine to make and test them to determine the best use of 
derivatives of the enantiomers in the racemic mixture, any one embodiment 

generaUy lower dkyl esters (for example of carboxyl), and z gioupscapable of base-pairing arebeUeved to hydrogen 

JvZwTf th T'.f f^^J™^'''^ fl^^^^g^' S^-^^^Uy bond with of guanine in a complementary micleic add 

nyuroiysis rorimsmetnooio Desucce^anenzymemnst sequence when incorporated into a polycyclic substructure- 

be chosen that is capable of stereospeafic cleavage, so it is 20 substituted oKgonucleotide. The resulting duplex has greater 

^TS^h'T^^r '''tT r'r'-" stabimythanonecontaimnganativeGCpairbecausetheR^ 

esters are to be deayed, then one selects a group of est^es, g^^up provides an additional point for hydrogen bonding to 

fhe^eHv^J' ^^r". ^^^"T^^}^^ '^*=''^»y «° the complementary guanine b^. Tlius/these embodiments 
the denvative. Topical esterases are from hver, pancreas or serve as cytosine surrogates for supplemented Watson-Crick 
other anmial organs, and mclude porcme liver esterase. 25 base-pairing. In general, a base-pairing substituent Z is 
If the enatiomeric mixture separates from solution or a defined functionaUy as any group that, when taken together 
melt as a conglomerate, i.e., a mixture of enantiomerically- with the remainder of is capable of increasing the 
pure crystals, then the crystals can be mechanically temperatureofmeltingof anyof theo%onucIeotides3-9in 
separated, thereby producing the enantiomerically enriched Table 1 by at least about 2 degrees Centigrade when sub- 
preparation. This method, however, is not practical for large 30 stituted as shown in Table 1. 

scale preparations and is of no value for true racemic if d2 . . • u .-^ . .u . ■ r.. ^ 

compounds contam a substituent that is capable of 

A«»v.™.Lv • .u . u • t u- • contributing to base pairing or hydrogen bonding then is 

Asymmetnc synto b another techmque tor achievmg usefiil at least as a point of attachmeni for a detectable label, 

enantiomenc emichment. For example, a chiral protecUng such Z groups need only be reactive with a bifimctional 

group IS reacted with the group to be protected and the 35 ^.^ ,^ 

Z^LfZT T.^ T ^^T- }^ '^V^'^'^^K embodiments, the polycyclic substructure is itself 

Tthlt™ omer fluorescent, and in the^ 4^ it is not necessary to link the 

. r ^ groiq) to a detectable label. In these embodiments the 

Further guidance m the separation of enanUomeric mix- polycyclic substructure is detectable by fluorescent 

toes can bo foimd, by way of example and not limitation, in ^ emissions, or by adsorption and energy transfer to an emit- 

bmmtiomere, Racemates, and resolutions", Jean Jacques, ting (second) label present on a binding partner in the same 

Andre Collet, and &muel H. Wflen (Krieger Publishing fashion as is used in EMIT technologies well-known in the 

Company, Malabar, Fla., 1991, ISBN 0-89464-618-4): Part diagnostics field 

2, Resolution ofEnantiomer Mixture, pages 217-435: more c. a- u . j -.i. r . . r, 

pardcularly, section 4, Resolution by Direct CrystalH^ation, « „ fPff^"^ f'^^'Tf ""^k^?- ^ ^ ^ ^^^^.^ 

pages 217-251, section 5, Fomiation and Separation o Lfr™Jlv,"^Tt^^'^^\ T^^^ T 

Diastereomers, pages 251-369, section 6, CryLlization- P^f^^Wy 1 or 2. orfmanly 1. Snnda^^^^ 

Induced Asymmetric Transfomiations, pages 369-378; and :^AX'^^^T'.''!^f'fL ft ^^""^rt^'r!?-^^ 

section 7, Experimental Aspects and of Resolutions, ^P^**^ ^when R is intended to function as a label bondmg 

pages 378-435; still more particularly, section 5.1.4, Reso- 

lution of Alcohols, Transformation of Alcohols into Salt- ^° embodiments the spacer group A and the Z 

Forming Derivatives, pages 263^266, section 5,2.3, Cova- substituent(s) will interact functionally, i.e., changes in 

lent Derivatives of Alcohols, Thiols, and Phenols, pages ^""P ^ ^^^^ ^ impact on the physical or chemical 

332-335, section 5.1.1, Resolution ofAcids, pages 257-259, Properties of Z, and vice-versa. For example, it will be 

section 5.1.2, Resolution of Bases, pages 259-260, section ^5 understood by those skilled in the art that changes can be 

5.1.3, Resolution of Amino Acids, page 261-263, section introduced in spacer A that would reduce or increase the 

5.2.1, Covalent Derivatives of Acids, page 329, section ^^^^^^ ^ hydrogen bond or to react with a label or 

5.2.2, Covalent derivatives of Amines, pages 330-331, cross-linking agent. A readily apparent instance of this 
section 5.2.4, Covalent Derivatives of Aldehydes, Ketones, ^^^^^ substitution of A with electron donors or acceptors 
and Sulfoxides, pages 335-339, and section 5.2.7, Chro- proximal to a Z group, which may affect hydrogen bonding 
matographic Behavior of Covalent Diastereomers, pages ^^etween Z and guanine. However, it is conceptually useful 
348-354. to consider these domains to be functionally and structurally 

_ , ^ discrete taJdng into account interdomain interactions that 

Compounds of Structure (1)— PolycycUc would be apparent to the ordinary artisan. 

65 Spacer A typically contains a backbone chain of 2, 3, 4, 

R IS a key functionaUty. It is substituted on the polycycle 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 or 16 carbon atoms, any 

depicted m structure (1), less R^, The combination of the 1, 2 or 3 of which are optionally replaced with N, O or S 
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atoms, usually 1 N, O or S atom. The backbone chain refers 
to the atoms that connect the Z group(s) to the ring carbon 
atom at the binding site on the polycyclc. The number of 
spacer backbone atoms does not include terminal Z group 
atoms. does not include protected amine as described in 5 
U-S. Pat. No. 5,502,177. 

The spacer A backbone is linear or one or more backbone 
atoms are substituted, which results in branching. 
Ordinarily, when 1 Z group is present then A will contain a 
linear backbone of 2 to 8, usually 2 to 4 atoms. The 10 
backbone generally is carbon only, bonded by saturated or 
unsaturated bonds. If unsaturated bonds are present, the 
backbone generally will contain 1 to 2 double or triple 
bonds. Preferably, the backbone is saturated. If a heteroatom 
is present in the backbone it typically will be 0 or S. 15 
Preferably the heteroatom is O, and preferably only 1 0 is 
present in the backbone chain. Heteroatoms are used to 
replace any of the backbone carbon atoms, but preferably are 
used to replace the carbon atom alpha (adjacent) to the 
polycyclic ring. Usually the atom in the spacer chain that is 20 
bonded to the polycyclic substructure is unsubstituted, e.g., 
— 0 — , — S — , — ^NH — or — CH2 — , and, in general, the 
next 1, 2 or 3 atoms in the spacer are unsubstituted carbon. 

The spacer A backbone is optionally substituted indepen- 
dently with 1, 2 or 3 of the following: C^-Cg alkyl, ^R^, 
=0, — NO2, — N3, — COOR^ — N(R% or — CN groups, 
Ci-Cg alkyl substituted with —OH, =0, — NO^, — N3, 
— COOR^ — N(R% or — CN groups, or any of the fore- 
going in which — CHj — is replaced with — 0 — , — ^NH — 
or — N(Ci-C8 alkyl), wherein R^ is H or a protecting group. 
Certain of these groups may function as Z sites for linking 
to detectable labels, but need not be used for that purpose 
unless desired. In some embodiments these substituents are 
useful in increasing the lipophilicity of the compounds of 
this invention. 

Group Z detectable labels include all of the conventional 
assayable substances used heretofore in labeling oligonucle- 
otides or proteins. Examples are well known and include 
fluorescent moieties such as fluorescein, chemiluminescent 
substances, radioisotopes, chromogens, or enzymes such as 
horseradish peroxidase. For the purposes herein, the residue 
of any bifunctional or multifunctional agent used to 
crosslink the Z group(s) to the A backbone is defined to be 
part of the Z group, and the residue of the detectable label 
is considered also to represent part of Z. 

Group Z also encompasses substituents that are not 
detectable by conventional diagnostic means used in clinical 
chemistry settings (e.g., UV or visible light absorption or 
emission, scintillation or gamma counting, or the like) but 50 
which are nonetheless capable of reacting with a crosshnk- 
ing agent or a detectable label to form a covalent bond. In 
this regard, the Z groups function as interaiediates in the 
synthesis of the labelled reagent. Typical Z groups useful for 
this purpose include — NH2, — CHO, — SH, — COjY or OY, 55 
where Y is H, 2-hydroxypyridine, N-hydroxysuccinimide, 
p-nitrophenyl, acylimidazole, maleimide, trifluoroacetate, 
an imido, a sulfonate, an imine 1,2-cyclohexanedione, gly- 
oxal or an alpha-halo ketone. Suitable spacers, reactive 
groups and detectable labels have been described, e.g., U.S. 5Q 
Pat. Nos. 5,668,266, 5,659,022, 5,646,261, 5,629,153, 
5,525,465 and 5,260,433, WO 88/10264, WO 97/31008, EP 
063 879 Bl, Urdea "NAR" 16:4937-4956 (1988), Prober 
"Science" 238:336-341 (1987). 

Z also is a hydrogen bond donor moiety or a moiety, when 65 
taken together with the influence of spacer A, has a net 
positive charge of at least about +0.5 at pH in aqueous 



solutions. Such Z groups are designated R^. In these 
embodiments, R^ is covalently Hnked to a short spacer A 
having a backbone (otherwise described above) of 2, 3, 4, 5 
or 6 atoms, designated R^. 

The R^ short spacer chain backbone atoms are C atoms 
and optionally one or two atoms independently selected 
from the group consisting of O, N or S atoms. R^^ short 
spacer chain backbones include unbranched and branched 
alkyl that optionally contain one or two independently 
selected 0, N or S atoms. Usually R^ is unbranched, i.e. the 
backbone has no hydrocarbon substituents. Any branching, 
if present, will usually consist of a C1-C3 alkyl group, 
usually a methyl or ethyl group, or C^-Cg alkyl substituted 
with —Oil, =0, — 0(Ci~C3 alkyl), -—CN, N3 or 1, 2, 3 or 
4 halogen atoms. 

Exemplary — R^^— R^ and related structures are 

(a) --RMCH,),— NR^qNR^)(NR% including — 0— 
(CHA— NR^C(NR^)(NR^)., —NH-HpH^) — NR^C 
(NR^)(NR\ and ^CH2)2''5NR5C(NR^)(NR%, 

(b) — R^— CH2— CHR^'— N(R^)2, 
-R^-(R\-N(R^)^, ^RMCH^),N(R^)2, 
-(CH2)^(R% -CH^-O-^CH^) -N(R\, 




N(R3)2,and 



(40) 



(41) 



4n 



45 



where R^ is usually — O — , 




^R' 



i2,or 



(42) 



(43) 



(44) 



(45) 



(46) 



) 



N(R3)2, 



where R^ is usually —O— and R^ is usually — CII2— or 
— CH2CH2— in structtires (42)-<46) and adjacent R^ and R^ 
are not —0—0—, — 0-— S — or S— ; 
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is independently — H, — CH3, 
— (CHj)^ — ^N(R^^2 ^ protecting group, usually — 

or — CH3, 

or, both together are joined to form a protecting group, 
or, when R^ is —R%CH2)M^%, one R^ is H, CH3, 
CH2CH3, a protecting group or —(CU^^—N(R^\ 
and the other R^ is — H, — CH3, — CH2CH3, 
-^Cn^^—N(R^%, ^H(N[R^^]2)--N(R^% 



(48) 




.or 



(49) 



(50) 




H 



usually when one R^ is — H, — CH3, — CH2CH3, or 
— (CH2)v— N(R^^)2, the other R^ is — H or a protecting 
group; 

is independently — H or a protecting group; 
is independently — S— , — NR^— , — O— or 

— CH2 — ; 

R^ is independently linear alkyl having 1, 2, 3 or 4 carbon 
atoms, linear alkyl having 2, 3 or 4 carbon atoms and 
containing one — CH=CH — , — C^C — or — CH2 — 
0 — CH2 — moiety, or R'^ is cychc alkyl having 3, 4 or 
5 carbon atoms, wherein one of the linear alkyl carbon 
atoms is optionally substituted with a single — CII3, 
— CN, =0, — OH or protected hydroxyl, provided that 
the carbon atoms in any — CH=CH — or — CH2 — 
0 — CH2— moiety are not substituted with =0, — OH 
or protected hydroxyl, and usually R^ is — CH2 — , 
— CH2— CH2— , -~CH2— CH2-<3i2— or — CH2— 
CH2 — CH2 — CH2 — ; 

R^ is linear alkyl having 1 or 2 carbon atoms wherein one 
of the linear alkylene carbon atoms is optionally sub- 
stituted with a single — CH3, -~€N, =0, —OH or 
protected hydroxyl, or R® is absent and R^ is linked 
directly to the riog in R^ structures (42)-{46), usually 
R^ is — CH2— or --CH2— CH2— ; 

R^^ is independently — CH.— —CU(Ci{^)— -~-CH 
(OCH3)— , — CH(OR^)~ or —0—, but both are not 
-0-; 

R^ is independently — N— , — N(CH3)— , — CH— , 
— CXCHg)— , but both are not — N(CH3)— ; 

R^° is — H or N(R^)2, usually — H or — NH2; 

R^^ is the side chain of an amiQo acid, usually the side 
chain of a naturally occurring amino acid, e.g. glycine, 
alanine, valiae, isovaline, leucine, threonine, serine, 
lysine or arginine; 

R^^ is independently — H, — CH3, — CH2CH3 or a pro- 
tecting group; 

R^^ is H, C^-C^ alkyl (including — CH3, — CH2CH3) or 
a protecting group, usually — H or a protecting group; 



R^^ is — ^H, — CH3, — CH2CH3, a protecting group, a 
monosaccharide, where the monosaccharide is usually 
linked at the monosaccharide's 1' position and where any 
monosaccharide hydroxyl groups are optionally protected, 
typically an R^^ monosaccharide is 2^-deoxyribose, a 
2'-deoxy-2-R^^ -substituted ribose or arabinose such as 
2'-deoxy-2*-fluororibose or 2'-deoxy-2'-fluoroarabinose, or 
the monosaccharide is ribose or arabinose; 
10 t is 1, 2, 3 or 4, but when R^ is — O — , — S — or 
— NR^— , t is 23 or 4; 
v is independently 0, 1 or 2; and 
w is 1 or 2. 

Invention embodiments include R^ moieties having the 
15 structure — R^^— NHj where R^^ has the structure — R^— 
R^— , inchiding — R^— {CH2)/— N(R3)2, where R^ is usu- 
ally — 0-~, ~-S— , — NH— or — CH2— , R^° is 
— CHR^MCHR^')^3-~<R^^)2i--(CHR^')^2— CHR^'— 
where R^' is —0— C(0), — CHR^ or — NR^— and 

usually 0, 1 or 2 R^^ are methyl or ethyl; R^^ independently 
is — ^H, methyl or ethyl; Z3 is 1, 2 or 3, usually 1; Zl is 0 
or 1, usually 0; and Z2 is 1, 2 or 3, usually 1. In these 
embodiments, any functional groups, e.g., — OH, — ^^^H2, 
25 — COOH, or — ^SH, that are optionally present at R^^ are 
usually protected. These embodiments are useful as inter- 
mediates useful to make monomers for oligonucleotide 
synthesis. 

Other invention embodiments include R^ moieties having 
the structure 
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including (CHj)!-*' 



NR5 

^R«— R^— NH ^ NHR^ 

including (CH2)i-5NH ^ ^NHR^; 

r29_r29 
J/ \ ^Oi 



H 



including h 



N 
R" 



■NH ^R«»— r6 



, including 



NH(CH2)i_2-R* 



R" 




; and 



R*°-R* , including 
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where R*'^ is — ^H, alkyl having 1, 2, 3 or 4 carbon atoms or 
optionally protected substituted alkyl having 1, 2, 3, 4, 5 or 
6 carbon atoms including — CH3 and — CH2CH3, and R^^ is 
—H, — NH2 or — NH(CH3). Other embodiments are 
— RMCH2)2.8— NH2, — RMCH2)2.8— OR^ or — R^— 
(CH2)2.8-C02R^. 

Compounds of Structure (1) — Substituent R^ — 
Linker 

One uses R^ linker groups to covalently bond the inven- 
tion base to the selected binding partner, although it wiU be 
understood that this need not be the sole use for the Hnker 
functionality. Hius, a group present in R^ linkers principally 
serves as the site for covalently bonding the invention base 
to a binding partner, typically by incorporating the invention 
base via the Hnker residue into a polymeric binding partner 
by grafting or copolymerization. 

R^ linkers also optionally are substituted with groups that 
ordinarily will not participate in binding to the binding 
partner, e.g., halo, azido and protected hydroxyl. Generally, 
such Ikiker groups will contain from 2 to about 50 atoms. If 
it contains a cycle the cychc functionality typically will be 
an oxygen, sulfur or phosphorus-containing saturated or 
unsaturated heterocycle having a total of about from 5 to 7 
ring atoms and 1 to 3 heteroatoms. For the most part, the 
cycle will be a monosaccharide, typically (i) a hexose, (ii) a 
hexose such as glucose substituted with phosphate, pro- 
tected phosphate, hydrogen phosphonate, a 
phosphoramidate, hydroxyl or protected hydroxyl, (iii) a 
furanose or (iv) a furanose substituted with phosphate, 
protected phosphate, hydrogen phosphonate, a 
phosphoramidate, hydroxyl or protected hydroxyl. Typical 
furanose sugars include ribose, 2 -deoxyribose, 2'-deoxy-2'- 
R^^-substituted ribose and their 2' ara isomers. Ordinarily, 
R^ is an abasic nucleotide residue or such a residue deriva- 
tized so as to be capable of incorporation into an oligonucle- 
otide. 

Thus, the R^ linker frequendy comprises an activated 
group or other group which can react with a polymer or other 
binding partner to be labeled with the polycyclic substruc- 
ture. For example, groups described below that are compat- 
ible with commonly available oligonucleotide synthetic 
chemistries are useful Other examples of reactant groups for 
covalent labeling are well-known from the diagnostic fields 
and have heretofore been used commonly to label proteins 
and oligonucleotide probes, as is more fidly discussed 
below. 

In one embodiment, R^ is a bifunctional or multifunc- 
tional organic linker group such as alkyl, alkene, alkyne, 
alkoxyalkyl, alkylthioalkyl, alkoxy, saturated or unsaturated 
heterocycle that is substituted with at least one group 
capable of being crosslinked with or incorporated into a 
polymer, e.g., such groups as hydroxy, amino, carboxyl, 
vinyl, phosphate or phosphonate. U.S. Pat. No. 5,502,177 
describes suitable Hnker groups. An example of such an R^ 
Hnker suitable for oligonucleotide synthesis is protected 
monosaccharides, such as ribofuranose and deoxyribofura- 
nose sugars of structure (5) 
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(5) 

where an invention base is Hnked to the open valence at the 
1' position, D is hydroxyl, protected hydroxyl or is an 
oHgonucleotide coupling group and is independendy R^^ 
or an oligonucleotide coupling group, but both are not 
coupling groups. 

In embodiments of the invention where the compound of 
structure (1) is to be used as a monomer in the preparation 
of oHgonucleotides, R^ is typically stmcture (5) where one 
is an oligonucleotide coupling group and D is — OH or 
protected hydroxyl. 
"CoupHng group" as used herein means any group suit- 

20 able for generating a phosphodiester linkage or phosphodi- 
ester substitute linkage between nucleotide bases or their 
analogs. TTiese coupling groups are conventional and well- 
known for the preparation of oHgonucleotides, and are 
prepared and used in the same fashion here. They are usually 

25 configured as the b anomers as denoted in structure (5) or 
optionally as the alpha anomers. In general, each compound 
comprising stmcture (5) wiU contain two coupling groups: D 
or D\ but with only one being a coupling group. The 
coupHng groups are used as intermediates in the preparation 

30 of 3',5' 5',3', 5',2' and 2',5' intemucleotide Hnkages in accord 
with known methods. 

Suitable coupHng groups for phosphodiester Hnkages or 
phosphodiester substitute links^es containing phosphorus 
include OH, H-phosphonate; (for amidite chemistries) alky- 

35 Iphosphonamidites or phosphoramidites such as 
P-cyanoethylphosphoramidite, N,N-diisopropylamino-P- 
cyanoethoxyphosphine, N,N-diisopropylamino- 
methoxyphosphine, N,N-diethylammo-methoxyphosphine, 
N,N-diethylamino-|3-cyanoethoxyphosphine, 

40 N-morphoHno-p-cyanoethoxyphosphine, N-morpholino 
methoxyphosphine, bismorpholino-phosphine, N,N- 
dimethylamino-p-cyanoethylmercapto-phosphine, N,N- 
dimethylamino-2,4-dichlorobenzylmercaptophosphine, and 
bis(N,N-diisopropylamino)-phosphine; and (for triester 

45 chemistries) 2-, or 4-chlorophenyi phosphate, 2,4- 
dichlorophenyl phosphate, or 2,4-dibromophenyl phosphate. 
See for example U.S. Pat. Nos. 4,725,677; 4,973,679; 4,997, 
927; 4,415,732; 4,458,066; 5,047,524; 4,959,463; 5,624, 
621; and International PubUcation Nos. WO 97/14706 and 

50 WO 92/07864. 

For structure (2) embodiments, if is a coupHng group 
then D typically wiH be hydroxyl protected with a group 
suitable for ensuring that the monomer is added to the 
oHgonucleotide rather than dimerizing. Such groups are well 

55 known and include DMT, MMT, FMOC 
(9-fluorenylmethoxycarbonyl), PAC (^)henoxyacetyl), a tri- 
alkyl (C;i~Cg alkyl, each alkyl group is independently 
chosen) silyl ether or an alkyl (C^-Cg alkyl) diaryl (e.g., 
phenyl) silyl ether such- as TBDMS (t-bu^ldiphenylsilyl) 

60 and TMS (trimethylsilyl). The opposite will apply when one 
desires to synthesize an oHgonucleotide in the opposite 
direction (5'^3% Ordinarily in structure (5) compounds, D 
is DMT, D^ is located on the 3' carbon, R^^ is H and the D^ 
and R^^ groups are in the alpha anomer conformation. 

65 As noted above, includes an optionally protected 
monosaccharides of structure (4) and (4A). UsuaUy the 
monosaccharides in structure (4) and (4A) compounds are 
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2'-deoxyribose, 2'-deoxy-2'-R^^-substituted ribose, 
2'-deoxy-2-R^-^ -substituted arabinose, ribose or arabinose, 
any of which arc optionally protected at sugar all or some 
faydroxyls or at optionally present R^^ functional groups 
such as — OH — SH or — NH2 groups. 

Invention embodiments include compositions of the for- 
mula 




10 
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where B is a structure (3), (4) or (4A) base, R^^ is a diene 
or a dieneophile, both as defined in WO 97/14706, R^'^ is 
— OR^, a coupling group includii^ — OH, H-phosphonate, 
a phosphoramidite or an optionally protected oligonucle- 
otide having a 3*-terminal group selected from a coupling 
group and — OR^ and any reactive moiety in R^^ is option- 
ally protected. R^^ dienes are independently chosen and 
include 2,4-hexadiene and 3,5-hexadiene. One or both R^^ 
are Hnked to a solid support such as a crosslinked organic 
polymer, polystyrene, Tentagel"^", polyethylene glycol or an 
inorganic oxide such as silica gel, alumina, controlled pore 
glass or a zeolite. These compositions are useful for making 
oligonucleotides containing one or more invention bases. 

Invention embodiments include compositions and their 
isomers of the formula 



N(R5)(R58) 



N(R^)(r5«) 



where R^^ is independently — ^H, a protecting group or both 
R^'^ together are a dihydroxy protecting group, R^^ is — H or 
alkyl containing 1, 2, 3 or 4 carbon atoms and B is a structure 
(3), (4) or (4A) base. Suitable R^ at the nitrogen atom 
include — H, FMOC and tBOC (t-butyloxycarbonyl) and 
suitable R^ at the carboxyl group include — ^H, t-butyl and 
benzyl, see, e.g., WO 97/14709. These compositions are 
useful for making oligonucleotides containing one or more 
invention bases. 

Protecting groups suitable for use with amine groups that 
may be present at R^ include FMOC and trichloroacetamide. 
Monomers and polymers may contain such protecting 
groups at R^. 

Substituent — ^Binding Partner 

R^, when functioning as a binding partner, is a substance 
that non-covalently binds to a target compound. Generally, 



the target compound is an analyte whose presence is desired 
to be detected. Binding partners are well-known from the 
immunoassay art and include hapten-antibody pairs such as 
those used in drug immunoassays using EMIT or ELISA 
technologies. Blading partners are employed analytically in 
enzymology, where the substrate or the enzyme is labeled. 
Binding partners also are known from the oligonucleotide 
hybridization art, including oligonucleotide-nucleic acid 
binding partners (as in diagnostic probes or therapeutic 
antisense oligonucleotides) or oligonucleotide-protein bind- 
ing partners (aptamers). In accordance with this invention, 
an invention base is substituted at R^ by any binding partner. 
While the binding partner may be a small molecule such as 
a drug, hapten, substrate or the like, ordinarily it is a 
polymer. 

Compounds of structure (1) wherein R^ is a polymer are 
an important feature of this invention. For the most part, 
when R^ is a polymer an R^ linker group has been subsumed 
into the polymer structure, either as a monomer unit or by 
grafting onto pre-existing polymer. Therefore, when R^ is a 
polymer, the polymer may comprise the residue of a linking 
group derived from a monomer or where the linking group 
differs from the polymer's monomeric subunits. The inven- 
tion base must be covalently linked to the polymer. 

The nature of the polymer is not critical. Typical R^ 
polymers include a biopolymer such as an oligonucleotide, 
a protein (including antibodies, enzymes, cell membrane 
proteins, glycoproteins, glycolipids, lipoproteins and 
nucleoproteins), a peptide, a nucleic acid, or a glycan or 
other polysaccharide or carbohydrate. In certain embodi- 
ments the polymer is an ohgonucleotide in which either or 
both of the sugar or phosphodiester monomer subunits are 
substituted by groups that continue to permit base pairing by 
the invention base analogs but which have other desirable 
characteristics that are not shared with native substituents, 
e.g., those which mask the negative charges of the phos- 
phodiester linkages or replace the phosphodiester linkage 
with another group. 

The site at which one links the invention base analogs to 
a polymer is typically not critical. In general, any reactive 
group on the polymer is satisfactory when one wants to graft 
the polycycle-R^ substructure onto a pre-existing polymer. 
Obviously, the site of the substitution should not be in a 
location in which the polycycle-R^ substructure will inter- 
45 fere with the intended function for the polymer, e.g. enzyme 
active site, antibody CDR, and the like as will be understood 
by the artisan. An amino acid side chain such as that of 
lysine, glutamic acid, serine, asparagine and the like will be 
satisfactory for grafting to protein R^, as ^^ill alpha amino 
groups, provided that the amino acids in question do not 
participate in the binding partner or ligand/subsirate inter- 
action involved in the assay in which the labeled protein is 
to be used. One appHes the same reasoning to select a 
binding site or sites on other analytes such as sugars, 
glycans, lipids, and the like. For example, the 1' position of 
ribose or deoxyribose is satisfactory as the site of substitu- 
tion of an oligonucleotide by the invention base analogs. 
Suitable sites will be known to the artisan, particularly in 
those instances where the one intends to substitute an 
invention base analog for purine or pyrimidine bases, usu- 
ally for cytosine, or for fluorescent labels. 

The degree of polymer substitution by the invention base 
analogs is not critical. One skilled in the art will choose the 
reaction conditions such that the resulting labeled polymer 
will be substituted with suf&cient molar proportion of base 
analog to facilitate its use in the desired analytical, thera- 
peutic or preparative procedm*e. This is accomplished by 
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preparing the labeled polymers under a variety of conven- 
tional conditions, e.g., the time, temperature or duration of 
the labeling reaction, to yield a matrix of multq)ly-labclcd 
polymers. These then are screened for suitability in the 
intended application. Molar ratios of about from 1:1 to 10:1 
invention base to polymer generally are suitable. Where the 
labeled polymer is prepared by monomer incorporation, the 
resulting polymer may contain about from 1% to 100% 
invention base analog substitution. In this embodiment each 
invention base is considered a monomer unit (even though 
the polymer may have been assembled from intermediate 
synthons containing 2 or more invention bases per synthon). 

Oligonucleotides are polymers containing at least 2 
covalently linked nucleotides or nucleotide analogs 
(collectively monomers), at least one of which comprises an 
invention base. In ohgonucleotide invention embodiments at 
least one invention base is covalently linked to a nucleotide 
sugar, and typical invention oligonucleotides will contain 
about 2-75% of the bases as invention base analogs, usually 
about 5-25%. Small oligonucleotides, e.g., 2-6-mers, that 
serve as synthetic intermediates for larger oligonucleofides 
win optionally contain the higher proportions of invention 
base analogs, e.g., about 50-75%. Larger oligonucleotides, 
e.g., about 7-21 -mers, will generally contain 1, 2, 3, or 4 
invention bases, occasionally 5 and usually not more than 
about 5 invention bases, unless the oligonucleotide is rela- 
tively long, e.g., about 22-50-mer. 

Invention embodiments include polymers and ohgonucle- 
olides where the invention bases are located on 2, 3 or more 
adjacent monomers or nucleotide residues, or the invention 
bases may be located on monomers or nucleotide residues 
that are separated from each other by about 1, 2, 3, 4, 6, 8, 
10, 12, 15, 18 or more monomers or nucleotide residues that 
do not contain these bases. When a detectable label is Hnked 
to an invention base at R^, the ohgonucleotide may contain 
1, 2 or 3 of these labeled monomers, usually 1 or 2. Such 
labelled monomers are often located at the 3' or 5' terminus, 
but they may reside at an internal position such as one, two 
or more monomer residues from either terminus. 

Invention oligonucleotides, which contain 1, 2, 3 or more 
invention bases, will typically have sufiScient binding affin- 
ity for complementary nucleic acid sequences to allow facile 
detection of the duplex or triplex resulting from the base 
sequence-specific binding interaction. Typically, an inven- 
tion oligonucleotide will have a Tm of at least about 15® C, 
usually at least about 20"* C, when tested under typical in 
vitro binding conditions, such as those described herein and 
elsewhere, (Jones, "J Oi^ Chem" [hereafter "JOC" ] 
58:2983-91 1993, Froehler, "TeL. Lett." 34:1003-06 1993), 
Complementary nucleic acid means a natural or synthetic 
compound that is capable of forming a hydrogen bonded 
complex in a sequence-specific manner with an invention 
oligonucleotide such as a structure (2) ohgonucleotide. 
Complementary nucleic acid base sequences contain no 
mismatches, while "substantially complementary" base 
sequences contain only a limited number of mismatches, 
e.g., at most about i mismatch per about 15-20 bases, 
relative to an invention ohgonucleotide. 

One optionally measures the binding of an invention 
ohgonucleotide to a complementary nucleic acid by detect- 
ing or measuring a Tm, by detecting the presence of a label 
present on the invention oligonucleotide or on the comple- 
mentary nucleic acid (after separating bound invention oli- 
gonucleotide from unbound invention oligonucleotide), by 
amplif jdng nucleic acids containing a region(s) complemen- 
tary to an invention oligonucleotide and so forth. Because of 
this, invention oligonucleotides optionally include species 
containing one or more modifications that decrease binding 
affinity, while the oligonucleotide still retains sufficient 
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binding aflSnity for a given apphcation. In addition, embodi- 
ments include short oligonucleotides or ohgonucleotide 
domains, e.g., having about 2, 3, 4, 5 or 6 Hnked monomers, 
where the cfomain may have low binding afBnity, but even in 
this case are useful as intermediates to make longer oligo- 
nucleotides so as to increase affinity sufficiently to confer a 
Tm of at least about 15° C. Generally, invention oligonucle- 
otide analogs will contain about 40% or less, usuaUy about 
25% or less, of monomers that significantly reduce binding 
affinity, i.e., monomers that decrease the Tm more than 

* about 2* C. per monomer, compared to a conesponding 
unmodified oligonucleotide. 

Invention embodiments include protected, partially pro- 
tected and deprotected monomers and polymers including 
ohgonucleotides. Partially protected compounds arise dur- 

' ing the course of deprotection and they are thus intermedi- 
ates in the process of preparing deprotected compounds. 
Typically, one would not recover partially deprotected com- 
pounds. Deprotected compounds have been subject to a 
treatment that removes the protecting group(s), although the 

I preparation may contain some compounds with unremoved 
protecting groups. Typically, any remaining protecting 
groups that remain after deprotection are present in small 
amounts that may be removed by suitable purification meth- 
ods if desired. 

Invention embodiments include oligonucleotides of struc- 
ture (2) where R^"^ is oxygen. Invention oligonucleotides, 
including those where R^"^ is oxygen, typically contain 2, 3, 
4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 
22, 23 or 24 linked monomers usually about 5-21. Such 
oligonucleotides optionally contain about 30-100%, typi- 
cally about 60-100%, of the linkages as phosphodiester or 
phosphorothioate Hnkages, or other hnkages of similar bind- 
ing afBnity. 

Invention oligonucleotides include support-bound 
oligonucleotides, which are typically used in soHd phase 
synthesis and separation applications. Support-bound oligo- 
nucleotides are typically protected during synthesis, e.g., 
bases, sugar hydroxyls, linkages and functional groups 
optionally present are protected as needed, e.g., a sugar 
hydroxyl group present at R^, an amine group at R^ or a 
hydroxyl or amine group at R^^. In these embodiments, R^ 
is covalently linked to a solid support or R^ is an oligo- 
nucleotide linked to a sohd support. When one removes 
invention ohgonucleotides from a support, the protecting 
groups are generally removed at the same time or shortly 
thereafter. 

Invention embodiments include highly lipophilic poly- 
mers and ohgonucleotides that comprise (i) one or more 
structure (3) bases, usually about 1, 2, 3 or 4, and (ii) 
Hpophihc modifications such that the polymer or oligonucle- 
otide has an octanohwater partition coefficient of about -0.5 
to about 2.5, typically about 0,0-2.0, usually about 0.2-1.5, 
and a solubihty in water of at least 0.001 jug/mL, usually at 
least 0.1 ^g/mL. 

One can use such highly lipophilic polymers and oligo- 
nucleotides as reagents to stain, detect or visualize hving 
cells in vitro or in vivo, as described in U.S. Pat. No. 
5,633,360 and in Application No. PCT US 96/12530. These 
highly Hpophihc polymers and oHgomicleotides need not be 
binding competent for cell staining, detecting or visuaHzing 
appHcations and they are optionaUy labeled using standard 
labels, e.g., radiolabels (^^p, ^% ^^C, ^H), fluorescent 
labels such as fluorescein, Texas Red, rhodamine, BODIPY, 
resonifin or arylsulfonate cyanines and chemiluminescent 
labels, e.g., acridinium esters. 

Embodiments of such optionally labeled highly hpophihc 
invention ohgonucleotides include species where (i) at least 
about 30%, typically at least about 40%, usually at least 
about 60% (often at least about 80%), of the interaucleotide 
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linkages are non-ionic internucleotide linkages (typically methylamine (methylimines) and metfayleneoxy 
containing a lipophilic moiety at each non-ionic lirikage), or (methylimino) (Debart, "Bioorg Med Chem Lett" 2:1479, 

(ii) ai least about 30%, typically at least about 40% usually 1992) and amino (PCT 91/06855). Also of interest are 

at least about 60%, of the bases included in said oligonucle- hydrazino and siloxane (U.S. Pat. No. 5,214,134) linkages, 
otide contain a lipophilic substitution, (iii) at least about ^ thionotriester linkages (WO 96/29337) and related synthesis 

30%, typically at least about 40% usually at least about 60%, methods (WO 97/31009). 

of the sugars, usually at the 2' position, included m said Phosphodiester substitute linkages per se also are known 

oligonucleotide contain a lipophilic substitution or (iv) the for the replacement of the entire phosphoribosyl linkage of 
percent non-ionic nucleotide linkage and the percent hpo- iq conventional oligonucleotides. These include for example 
phiHc bases and the percent lipophilic sugars sum to at least morpholino-carbamates (Stirchak, "NAR" 17:6129, 1989), 
about 30%, typically at least about 40% usually at least peptides (Nielsen et al., "Science" 254:1497, 1991; U.S. Ser. 
about 60% (or at least about 80%). Nos. 07/892,902 and 07/894, 397), riboacetal linkages (PCT 
Usually, the invention base analogs and other noninven- 92/10793) and morpholino-based linkages disclosed in U.S. 
tion bases that are present in binding-competent oligonucle- P^^- Nos. 5,521,063 and 5,185,144. 
otides are linked together by an organic moiety that is Additional disclosure of phosphodiester substitute link- 
sufficiently flexible to permit the invention base analog(s) to ages is found in U.S. Pat. No. 5,386,023, U.S. Pat. No. 
hybridize to complementary bases. The linkage may be a 5,489,677, WO 95/18623, WO 94/00467, WO 93/08296, 
conventional phosphodiester linkage in which a nucleotide 20 WO 92/20822, WO 92/20823, PCT 91/08213, 90/15065, 
analog containing a structure (1) compound, where is 91/15500, 92/20702, 92/20822, 92/20823, 89/12060 and 
deoxyribosyl, ribosyl or an analog thereof, which is incor- 91/03680; Mertes, "J Med Chem" 12:154, 1969; Mungall, 
porated into an oligonucleotide by conventional methods. "JOC" 42:703, 1977; Wang, "Tet Lett" 32:7385, 1991; 
Alternatively, other groups are used to replace the phos- Stirchak, "NAR" 17:6129, 1989; Hewitt, "Nucleosides and 
phodiester linkage or, in some instances, both of the phos- Nucleotides" 11:1661, 1992; Van Aerschot, "Agnew Chem 
phodiester linkage and the sugar group. These replacement Int Ed Engl" 34:1338, 1995; and U.S. Pat. Nos. 5,034,506 
groups are termed "phosphodiester substitute linkages" for and 5,142,047. 

the purposes herein. Invention embodiments include oligonucleotides having 
Phosphodiester substitute linkages are well-known from 30 1, 2, 3 or more optionally protected invention bases, 0 to 

the prior literature. They include for example phospho- about 30 other optionally protected bases, usually 

rodithioates (Marshal, "Science" 259: 1564, 1993), phospho- guanosine, adenine, thymine, cytosine or 5-methylcytosine, 

rothioates aiKl alkylphosphonates (U.S. Pat. No. 5,212,295, and at least one modified Hnkage, e.g., 3' — N(R^^)— O— 5', 

Kibler-Herzog, "Nucleic Acids Research" [hereafter "NAR" where R^^ is hydrogen, C^.^ alkyl, usually — CH3 or — C2H5 

] 19:2979, 1991; PCT 92/01020; EP 288,163; FIG. 12-1), and the terminal atoms are linked to the 3' and 5' carbons of 

phosphoroamidates (Froehler, "NAR" 16:4831, 1988), adjacent ribose or 2'-deoxy-2'-R^^ substituted ribose sugars, 
3'-NH phcsphoramidates (Schultz, "NAR" 24:2966, 1996; 

Gryaznov, "J Am Chem Soc" [hereafter "JACS"] 116:3143, (51) 
1994; Chen, "NAR" 22:2661, 1995; Gryaznov, "Proc Natl 40 
Acad Sci" USA 92:5798, 1995), phosphotriesters (Marcus- 
Sekura, "NAR" 15:5749, 198-^, boranophosphates (Sood, 
"JACS" 112:9000, 1991), 3'-0-5'-S- phosphorothioates 
(Mag, "NAR" 19:1437, 1991), 3'-S-5'-0-phosphorothioates 
(Kyle, Biochemistry 31:3012, 1992), 3'-CH2-5'-0- 

phosphonates (Heinemann, "NAR" 19:427, 1991), 3'-NH- 

5'-0-phosphonates (Mag, "Tet. Lett." 33:7323, 1992), sul- ^ 
fonates and sulfonamides (Reynolds, "JOC 57:2983, 1992), 
sulfones (Huie, "JOC" 57:4519, 1992), sulfoxides (Huang, 50 
"JOC" 56:3869, 1991), sulfides (Schneider, "Tet Lett." 
30:335, 1989), sulfamates, ketals and formacetals 
(Matteucci, "JACS" 113:7767, 1991, PCT 92/03385 and 
PCT 90/06110), 3'-thiofonnacetals (Jones, "JOC 58:2983, 
1993), 5*-S-thioethers (Kawai, "Nucleosides Nucleotides" 
10:1485, 1991), carbonates (Gait, "J Chem Soc Perkin Trans 
1" 1389, 1979), carbamates (Stirchak "JOC" 52:4202, 
1987), faydroxylamines (Vasseur, "JACS" 114:4006, 1992), 
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or both R"^ together are a protecting group, or is alkyl 
usually unbranched or branched once containing 
1, 2, 3, 4, 5, 6, 7 or 8 carbon atoms, including methyl, ethyl, 
n-propyl and isopropyl) or R'^^ is substituted alkyl (Ci-Ci2, 
usually unbranched or branched once containing 1, 2, 3, 4, 
5, 6, 7 or 8 carbon atoms, with substituents including one, 
two or more — O— , —C(0)— , — 0C(0)— , — C(0)0— 
— 0R^^ — SR'^^ — C(0)NR^^— -C(0)N(R^^)2, 
_Nr4i_ — N(R^% halo (e.g, — F, —CI), -~CN, — NO^ 
moieties); R"^^ independently is hydrogen, a protecting 
group (or both R"^^ together are a protecting group), alkyl 
(C1-C4 including methyl, ethyl and n-propyl); R'^^ is hydro- 
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gen or a protecting group; R'^^ is C^.g alkyl or a protecting 
group; and R"^^ is — ^H, a counter ion or a hydiolyzable 
moiety such as 
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where R^® independently is O, CH^ or NH; R"^ indepen- 
dently is hydrogen, C^j, alkyl (methyl, ethyl, n-piopyl, 
i-propyl, n-butyl, i-butyl, t-butyl, etc.), a protecting group or 
both R'"' together witfi tiie nitrogen atom to which they are 
attached form 



R"^^ is alkyl containing 1, 2, 3, 4, 5, 6, 7 or 8 carbon atoms. 
R'*^ pairs include ones where one R'*^ is hydrogen and the 
other R'^^ is alkyl containing 1, 2, 3, 4, 5 or 6 carbon atoms 
or substituted alkyl containing 1, 2, 3, 4, 5, 6, 7 or 8 carbon 
atoms, including methyl, ethyl, methoxyethyl and ethoxy- 
ethyl. When R'^is substituted alkyl, it will usually contain 
1, 2, 3 or 4 non-carbon atoms, but may contain additional 
non-carbon atoms, particularly when the non-carbon atoms 
are halogens or when a group is present as a protecting 
group, e.g., R^^, R'"', R^^ or R^^ 

Structure (2) and (52) oligonucleotides include species 
where one or more R^^ is — F, — 0(CH2)2NHR^, 
— 0(CH2)3NHR^ — 0(CH2)4NHR^ —OiQ^^^OC^^, 
--0(CH2)30CH3, ^(CH2)20R', — 0(CH2)2F, — ©(CHa) 
30R^ or — 0(CH2)3F (see, e.g., Griffey "J Med Chem" 
39:5100-5109 1996, Schultze "CeU" 24:2966-2973 1996). 
Such oligonucleotides include oHgonucleotides where 1, 2, 
3, 4, 5, 6, 7, 8 or more monomers are substituted with R^^, 
which will optionally comprise one of ihese substituents and 
the remaining R^^ are all hydrogea Embodiments also 
30 include optionally protected monomers containing an 
optionally protected invention base for synthesis of 
phosphoramidate-linked oligonucleotides. Oligonucleotides 
containing one or more of these linkages are optionally 
prepared as highly lipophilic oligonucleotides and they are 
suitable for cell staining uses, diagnostic uses and for 
antisense applications that optionally rely at least in part on 
an RNase H mechanism. 

Invention embodiments include oligonucleotides or 
monomers described in U.S. Pat. Nos. 5,670,489, 5,667,976, 
5,652,355, 5,6.52^56 and 5,212,295 where one or more 
optionally protected invention bases is present, usually 1, 2, 
3 or 4. 

Invention embodiments include oligonucleotides having 
1, 2, 3 or more optionally protected invention bases, 0 to 
about 30 other bases (optionally protected) and at least one 
amide linkage, e.g., a compound of structure (2B) where n 
is 0 to about 50, usually about 5-21. Such amide linkages 
have been described, e.g., (Haaima "Agnew Chem Int lid 
Engl" 35:1939-1942 1996; Nielsen "Bioconjugate Chem" 
5:3-7 1994). Other amide linkages that are suitable have 
been described, e.g., WO 92/20702 and WO 93/24507. 
Embodiments also include optionally protected monomers 
containing an optionally protected invention base for syn- 
thesis of amide-linked oligonucleotides. In general, structure 
(2B) oligonucleotides will contain only amide linkages, but 
they may also comprise a domain of monomers linked by 
non-amide linkages. Suitable and have been 
described, e.g., WO 86/05518, WO 92/20702, WO 
93/24507. optionally comprises a peptide coupling 
group, a protecting group, or a solid support. optionally 
comprises — H, a peptide coupling group, a protecting 
group, or a solid support, but and are not both a 
peptide coupling group or a solid support, WO 92/20702 
described activated derivatives of — COjH and — SO3H. 
65 The phosphodiester or phosphodiester substitute linkages 
herein are used to bond die 2' or 3* carbon atoms of ribose 
or ribose analogs to the 5' carbon atoms of the adjacent 
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ribose or ribose analog. Ordinarily, the linkages in oligo- Embodiments of the oligonucleotides of the invention 
nucleotides are used to bond the 3' atom of the 5' terminal comprise a moiety which is capable of effecting at least one 
oligonucleotide to the 5' carbon atom of the next 3'-adjaccnt covalcnt bond between the oligonucleotide and a nucleic 
nucleotide or its analog. In general, linkages that contain a add duplex or strand. Multiple covalent bonds can also be 
phosphorus atom will be 3',5' linkages and not 2\9 linkages 5 formed by providing a multiplicity of such crossliuMng 
' because such linkages usually confer reduced binding affin- moieties. The covalent bond is preferably to a base residue 
ity on the oligonucleotide in which they are present. in the target strand, but can also be made with other portions 
Table 1 of U.S. Pat. No. 5,502,177 describes examples of of the target, including the saccharide or phosphodiester. 
suitable phosphodiester substitute linkages for use with the Preferred crosslinking moieties include acylating and alky- 
invention base analogs, llie starting materials in Table 1, or lo iating agents, and, in particular, those positioned relative to 
those used to prepare the starting materials of Table 1, the sequence specificity-conferring portion so as to permit 
generally possess structure (1) in which is ribose, reaction with the target location in the strand. Exemplary 
2'-deoxyribose, a ribose analog or a 2'-deoxyribose analog crosslinking moieties are disclosed and claimed in PCT 
comprising a 5' hydroxyl group and a 3' or 2' hydroxyl 91/03680. See also Praseuth ("Proc Natl Acad Sci" 8:1349, 
group, prepared as described herein or in the citations, with 15 1988), Fedorova ("FEBS" 228:273, 1988), Meyer ("JACS" 
an invention base analog(s) being substituted for the bases 111:8517, 1989), Lee ("Biochemistry" 27:3197, 1988), 
used in the citations. The reactions are repeated or ganged Home ("JACS" 112:2435, 1990), Shaw ("JACS" 113:7765, 
with phosphodiester or other Linkages in order to produce 1991). 

trimers, tetramers, pentamers or larger oligonucleotides, Invention embodiments include monomers and oligo- 

including ones up to about 100 bases. 20 nucleotides containing 1, 2, 3 or more invention bases and 

The oligonucleotides of this invention contain naturally a 5' hydroxyl group protected with a base labile protecting 

occurring nucleotides or derivatives thereof. In some oligo- group, including a dansylethoxycarbonyl group, which has 

nucleotide embodiments the companion nucleotide residues been described, e.g., U.S. Pat. No. 5,631,362. Such embodi- 

contain pyrimidine nucleotides substituted at the 5 position ments optionally include additional protecting groups, 

with a carbon atom which is distally Pi bonded to another 25 Invention embodiments include monomers and oligo- 

atom as for instance 1-alkenyi, 1-alkynyl, heteroaromatic nucleotides containing 1, 2, 3 or more invention bases and 

and l-alkynyl-heteroaromatic groups such as 5-(l- an invention base or a non-invention base having an exo- 

propynyl)-cytosine and -uridine nucleotides (see PCT Pub- cyclic nitrogen atom, where the nitrogen atom is protected 

lication No. WO 93/10820 and U.S. Pat. No. 5,594,121). with a protecting group as previously described, e.g., speci- 

Other analogs of native bases for use herein include alky- 30 fication and claims 1, 2, 3, 4, 5, 6, 7 and 8 of U.S. Pat. No, 

lated purines or pyrimidines, acylated purines or 5,623,068. Such embodiments optionally include additional 

pyrimidines, or other analogs of purine or pyrimidine bases protecting groups. 

and their aza and deaza analogs. These include, for example Invention embodiments include optionally protected 

N'^,N'^-ethanocytosine, 7-dea2axanthosine, monomers and optionally protected oligonucleotides con- 

7-deazaguanosine, 8-oxo-N^-methyladenine, 35 taining 1, 2, 3 or more invention bases wherein the compo- 

4- acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, sitions possess N-branching, which has been described, e.g., 

5- fluorouracil, 5 -bromouracil, U.S. Pat. No. 5,623,049. 

5-carboxymethylaminomethyl-2-thiouracil, Invention embodiments include "hybrid" 

5-carboxymethylaminomethyl uracil, inosine, oligonucleotides, which contain 1, 2, 3 or more invention 

N^-isopentenyl-adenine, 1-methyladenine, 40 bases and 2' modifications in one or two regions or domains 

2-methylguanine, 5-methylcytosine, N^-methyladenine, that comprise adjacent linked monomers, typically about 

7-methylguanine, 5-methylaminomethyl uracil, 5-methoxy 2-^8 linked monomers, usually about 2-3, One domain 

aminomethyl-2-thiom:acil, 5-methoxyuracil, pseudouracil, contains 2' modifications, while hydrogen is linked to the 

5- methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-(l- remaining monomers, typically about 4-10 adjacent linked 
propynyl)-4-thiouracil, 5-(l-propynyl)-2-thiouracil, 5-(l- 45 monomers, usually about 6-8. Such oligonucleotides con- 
propynyl)-2-thiocytosine, 2-thiothymidine, and 2,6- tain at least one domain that is competent to serve as a 
diaminopurine. In addition to these base analogs, one can RNase H substrate and comprises hydrogen at each 2' 
conveniently incorporate into the invention oligonucleotides position and phosphodiester, phosphorothioate or phospho- 
other base analogs, including pyrimidine analogs including rodithioate 3',5' linkages. The other domain(s) contain a 2* 

6- azacytosine, 6-azathymidine, 5-trifluoromethyluracil or 50 modification(s), such as — O — (€112)2^ or — O — (CU^)^^ — 
other bases previously described, see, e.g., bases, monomers O — CH3, that enhances binding affinity or nuclease stability, 
or oligonucleotides described in WO 92/02258, WO The 2'-modified domain(s) is usuaUy not an efficient RNase 
97/32888 and U.S. Pat. No. 5,614,617. H substrate. The bases in hybrid oligonucleotides are the 

Preferred bases include adenine, guanine, thymine, uracil, typical purines and pyrimidines found in nucleic acids (G, A, 

cytosine, 5-methylcytosine, 5-(l-propynyl)uracil, 5-(l- 55 T, C or U) or their analogs, which one finds in some 

prDpynyl)cytosine and 5<l-butynyl)uracil, 5-(l-butynyl) oligonucleotide analogs (e.g., U.S. Pat. Nos. 5,484,908, 

cytosine. 5,594,121 and 5,502,177, International PubHcation No. WO 

Invention embodiments include the protected derivatives 93/10820). Intermediates used to prepare hybrid oligonucle- 

of native bases, their analogs and optionally protected otides will typically contain appropriately protected deriva- 

monomer synthons containing such bases, which one would 60 tives of any bases. 

typically use as intermediates to prepare invention oligo- Oligonucleotides of inverted polarity also fall within the 

nucleotides (see, e.g., International Publication No. WO scope of this invention. "Inverted polarity" means that the 

96/37504, U.S. Pat. No. 5,614,622 Iyer et al., "Nucleosides oligonucleotide contains tandem sequences which have 

& Nucleotides", 14: 1349-57, 1995, Uhlmann et al., "Chem opposite polarity, i.e., one having polarity 5'->3' followed by 

Revs", 90:543-587, 1990, S. Agrawal, Ed. Methods in 65 another with polarity 3'^5', or vice versa. These sequences 

Molecular Biology^ \bl 20, Protocols for Oligonucleotides thus are joined by linkages which can be thought of as 

and Analogs, pp. 165-189, Humana Press, 1993) effectively a 3'— 3' internucleoside junction (however the 
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linkage is accomplished), or effectively a 5' — 5^ intemucleo- 
side junction. For a further description of suitable methods 
for making such oligonucleotides sec, e.g., WO 93/10820. 
Compositions of "parallel-stranded DNA" designed to form 
hairpins secured with AT linkages using either a 3'-3' inver- 5 
sion or a 5-5* inversion have been synthesized by Van de 
Sande, "Science" 241:551, 1988. In addition, oligonucle- 
otides which contain 3' -3' linkages have been described 
(Home, op cit; and Froehler, "Biochemistry" 31:1603, 
1992). These oligonucleotides are useful as binding partners lO 
for double stranded nucleic acids to form triple helix (or 
triplex) complexes as a means for detecting complementary 
sequences and inhibiting of target gene expression (PCX 
89/05769 and 91/09321). 

Invention embodiments include polymers such as oligo- 15 
nucleotides containing 1, 2, 3 or more invention bases, 
where the polymer is a component of a complex or compo- 
sition useful for Iransfecling the polymer into a cell in vitro 
or in vivo. These complexes or compositions are referred to 
herein as "transfection complexes". Such transfection com- 20 
piexes optionally comprise one or more lipids, e.g., cationic 
or anionic lipids, as well as other lipophilic compounds such 
as cholesterol or colipids such as DOPE. The complexes 
optionally comprise an uncharged or a charged polymer such 
as polyetiiylene glycol, polybrene or a peptide, e.g., polyl- 25 
ysine. The complexes optionally comprise unilamellar or 
multilamellar liposomes or vesicles. 

As used herein, any compound(s), reagent(s) or treatment 
that enhances deHvery of an invention oligonucleotide into 
a ceU or tissue is a "permeation enhancing agent." Perme- 30 
ation enhancing agents are well known and are usually 
present as transfection complexes containing 
oligonucleotides, e.g., unilamellar or multilamellar lipo- 
somes or vesicles. One uses permeation enhancing agents to 
prepare transfection complexes containing invention oligo- 35 
nucleotides. The permeation enhancing agent are used in 
essentially the same manner as is used to prepared transfec- 
tion complexes containing nucleic acids, non-invention oli- 
gonucleotides or polymers into cells or tissues. 

Invention transfection complexes optionally comprise an 4n 
additional non-invention polymer(s), e.g., a nucleic acid 
expression vector(s), a therapeutic agent(s) (e.g., amphot- 
ericin B) or a peptide(s). 

Invention transfection complexes comprising a lipid may 
be, as defined herein, *'large", i.e., complexes having a 45 
maximum averse dimension of at least about 200 nm in 
length or diameter, typically having an average length or 
diameter of about 200-400 nm, occasionally having an 
average length or diameter of about 400-800 nm. Transfec- 
tion complexes comprising a hpid may be "small", i.e., 50 
complexes haviug a maximum average dimension of about 
15-200 nm in length or diameter, e.g., an average dimension 
of about 60-120 nm. Transfection complexes may comprise 
a mixture of lai^e and small complexes in about equal 
proportions or they may comprise a preponderance of small 55 
or large transfection complexes, e.g., at least about 55%, or 
at least about 60-80% of the complexes in a given prepa- 
ration are large or small. 

Transfection complexes comprising a lipid optionally 
include a stabilizing compound(s), e.g., a monosaccharide or 60 
a disaccharide such as glucose, trehalose, maltose or 
sucrose, that is present at the outer surface or at the inner 
surface or at both surfaces of transfection complexes. Work- 
ers have described suitable compounds such as lipids, colip- 
ids and stabilizing compounds for making transfection 65 
complexes, methods to size the complexes and methods to 
use the complexes to deliver a polymer or monomer into the 



cytoplasm of a cell in vitro or in vivo, e.g., U.S. Pat. Nos. 
5,635,491, 5,633,156, 5,631,018, 5,629,184, 5,627,159, 
5,626,867, 5,620,689, 5,595,756, 5,543,152, 5,478,860, 
5,459,127, 5,264,618, 5,223,263, 5,194,654, 4,981,692, 
5,077,056, 4,522,803, 4,588,578, 4,885,172, 4,975,282, 
5,059,421, 5,000,958, 5,030,453 and 5,047,245, WO 
96/01840, WO 96/01841, WO 97/30969, WO 97/30732, 
Lewis, "Proc Nati Acad Sci" 93:3176-3181, 1996, U.S. 
patent appUcation Ser. No. 08/672,206. 

Invention transfection complexes useful for dehvering the 
invention oligonucleotides Lato cell cytoplasm also include 
complexes comprising inorganic compounds, e.g., calcium 
phosphate. 

The R^^ moiety is hnked to invention oligpnucleotidcs or 
monomers useful for oligonucleotide synthesis. R^^ is usu- 
ally linked to the 2' or the 3' position of foranose sugars. 
When R"^ is a nuclease stability enhancing moiety, a broad 
range of structures may be used to increase stability of 
oligodeoxynucleotides or oligoribonucleotides containing 
phosphodiester linkages. Oligonucleotides having moieties 
other than hydrogen or hydroxyl at the 2' position usually 
confer increased nuclease stability or increased binding 
affinity on the oligonucleotide relative to hydrogen or 
hydroxyl. Enhanced nuclease stability is conveniently mea- 
sured using dimers or short oligonudeotides as essentially 
described, e.g., WO 92/05186. One or two R^^ moieties at 
the 3' and 5' terminal monomers in an oligonucleotide will 
increase stability of the oligonucleotide to 3'- and 
5'-exonucleases. One may increase an oligonucleotide's 
stabihty to endonucleases by incorporating R^^ moieties that 
increase nuclease stability at internal monomer positions. 

In addition to increasing nuclease stability, some R^^ 
moieties enhance binding affinity of the oligonucleotide to 
which they are linked. Hiese moieties include fluorine and 
short unbranched optionally substituted O-alkyl groups con- 
taining about 2-8 carbon atoms, where the alkyl group is 
optionally substituted at the distal carbon atom with, e.g. 
— ^F, — OH or — NH2, and optionally substituted with an 
ether at an internal carbon, e.g., 
— O— (CH2)2— O— (CH2)2F, — 0~(CH2)20CH3 or 
---0-(CH2)2— 0--(CH2)2CH3. 

When R^^ is a nuclease stability enhancing moiety, it will 
typically comprise — CH3, =0 — ^NHR^ or a chain having 
a backbone containing 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or 12 
linked atoms, wherein the chain usually comprises carbon 
(C) atoms and optionally 1, 2, 3 or 4 atoms independentiy 
selected from the group consisting of oxygen (0), nitrogen 
(N) and sulfur (S) atoms. The chain is usually linked to the 
sugar carbon atom through — O — , — S — , — S(0) — , S(0) 
(0)— , — , =CH— or ^NH— . The chain is 
branched or unbranched, often it is unbranched or has only 
limited branching, e.g., — CH3, — CHjOH, — C2H5 or 
— C2H4OH. The R^^ chain may comprise a C^i^ a^cyl 
group or a C2.20 substituted alkyl group. If R^^ is a substi- 
tuted alkyl group, usually only 1, 2 or 3 carbon atoms are 
substituted. Suitable substituents include those described 
above for substituted alkyl groups, e.g., halogen (usually 
fluorine), — O — or — OR^. 

Invention embodiments include oligonucleotides and 
monomers where one or more monomers comprise 
2 -deoxyribose, ribose or arabinose sugars, or their carbocy- 
clic analogs, having one or more 2* R^^ modification such as, 
— O— alkyl, — NH—alkyl, ^—alkyl, — OR^ — NHR^ 
— SR^ -halo (usually — F), — R-^^-alkyl or — R44- 
substituted alkyl wherein the aikyl or substituted alkyl group 
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usually comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or 12 carbon the hydroxyl, amino and any other labile groups of are 
atoms, usually about 2-6 carbon atoms, where R"^"^ is inde- protected as required. In another approach, R^ of the starting 
pcndcntly — O — , — S — , — ^NH — or — CH2 — ^ usually material is H or a protecting group and one adds the linker 
— O — , The oligonucleotide linkages connecting such after the ring closure steps set forth in the schemes^ in the 
monomers are 3',5' linkages. The alkyl groups at the 2' 5 same fashion as has heretofore been employed in the alky- 
position typically have 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or 12 ^^^on of pyrimidine bases intended for use as antiviral 
carbon atoms which are optionally present as methylene compounds. 

groups ( — CH2 — ) and optionally have 1, 2, 3 or 4 ether ^° lhos& embodiments in which R^ is a binding partner 

(—0—) or other substitutions, e.g., 0-aikoxyalkyl(C2-Ci2 ^^^^ ^ polymer the compounds of this invention are 

alkyl), —O— (CH2)2 ^—CH^COsH, — O— (CH2)2 s— 10 synthesized by covalently crosslinking the linker modified 

CH2N(R^)2, — (— 0---(CH2)2.4— O— (CH2)2 4—0— " polycyclic base of this invention to the binding partner, or 

(CH2)2-4— R^^), — 0(CH2CH20) CH2— R^^ bmdmg partner is a polymer) by incorporating 

O CH CH R^^ 0(CH CH )OCCH CH )R^^' polymer a monomer unit which is substituted by an 

-OCH,cicFUher;R^^is -^H,halo(^^^^^^^ invention polycyclic base. 

-OR^ -OCH3, -NHR^ ^R^ and r is 1, 2, 3 or 4, 15 J"" ^'f embodiment (polymer grafting) a 

usually 1 or 2 -substituted polycyclic substructure is covalently bonded 

„ 1 * . .1. . . 1 . . , , . , via any conventional cross-linking agent to the polymer. 

The alkyl groups at the 2 position also mclude substituted Most conveniently, structure (1) compounds in which R^ is 

aDcyl, e.g., —0— alkylammo, — S-alkyiammo, — NH- hydroxy^ or amino-substituted alkyl are readily cross- 

alkylammo, or their protected derivatives, wherein the alkyl Hnked to reactive groups present in the molecule to be 

group contains 2, 3, 4, 5, 6, 1, 8, 9, 10, 11, or 12 carbon 20 labeled as noted above. Exemplary cross-linking agents 

atoms, which are all optionally present as methylene carbons include succinic anhydride, N-hydroxysuccinimide esters 

(— CH2— ). Usually the allqrl group or substituted alkyl (biotin NHS ester), epoxides, isothiocyanates, imidates, 

group at R^^ will contain 1, 2, 3, 4, 5, 6, 7 or 8 carbon atoms. DCC (dicclohexylcarbodiimide), EDC (l-ethyl-3-[3- 

Such groups include — O-methyl, — 0-ethyl, — O-n-propyl, (dimethylamino)propyl] carbodiimidc), BOP, and 

—O-aUyl, — 0^CH2)2-60H, including — O— {CH2)20H, 25 glutaraldehyde, see, e.g., HP 0 063 879, Ruth "J Org Chem" 

— O— {CH2)2F, — O—CH2CHF2, — O— CH2CF3, — O— 43:2870, 1978, Bergstrom, JACS 100:8106, 1978, Bigge, 

(CH2)2.60CH3, including — O— (CH2)oOCH3, —0— 102:2033, 1980. Cyanogen bromide activated carbo- 

(CH2)20CH2CH3, — O— (CH2)20ChCcH20H, —0— hydrates also are used. The cross-linking agents are used to 

(CH2)20CH2CH2F, — O— (CH2)2NHR^ -substituted polycycle to the polymer in the 

— O (CH2)3NHR^, C) (CH2)4NHR^, O (CH ) f' 30 fashion as polymers heretofore have been cross-linked 

— 0 (CH2) F, — O (CH ) F,"^ CF-,CF ' ligands, e.g., to hydroxyl or amino-bearing moieties. An 

Q /'pi/ \ D 65 ^pTT \ rr^ /pxT \ el' example of a suitable method is described per se in Cook et 

-NH-merhyl, ' -NH -etfy 1? iNH-n prolyl; ^ ^' -^^^^^ ^^^^^y ^PP^^^ 

-NH-(CH.).OH, ~NH-(CH2)30H, ^NH-^ChXf, covalently bond an ammo-substituted R^ Hnker to the 5' 

-NH-CH2^CF2CF3, -NH-(CH2)^^^ -J-methyl, 35 teimmus of an oligonucleotide. 

— S-ethyl,-^-n-propyl,~^-allyl,— ^-(CH2)pH,-^— ^^^^ R or R are ammo substituted, the foUowing 

(CH2)30H —S (CH2)2F ^ and exemplary synthetic approaches are suitable for crosslinking 

-S-<CH2),-40->{CH.).]^^^ where s is 2, 3, 4, 5, 6, 7 ^"^^^ moieties: 

or 8. R^^ moieties do not^ include unstable species at the 2' — CH2NH2+R'°— C(N-^H2)— 0R^°-^-^CH2NHC(N-^ 

position, e.g., — O — 0 — or — S — O — . 40 H2) — R 

Other suitable R^^ at the 2 or 3' position of optionally ~CH2NH2+R'''— N=C=S^--CH2NHC(S)NH— R^° 

protected invention monomers or optionally protected — CH^NH^+R^^-epoxide^ — CH2NHCH2CH(0H) — 
invention oligonucleotides include — ^H, — ^H, — NHOR^^, 

=NH, — N3, CN, — CH2CN, — CHCI2, — CFH2, is ^ organic moiety optionally containing a 
— CTall, =Cn2, =CF2, — <:n2CII=CIl2, =0, =CIIC hapten or other detectable moiety such as biotin or avadin. 
(O)0R^^ (including =CHC(0)0CH3 and =CHC(0) 1° second embodiment (copolymerization) the R^ 
OCH2CH3), — 0C(S)0Ce,H5, t-butyldimethylsilyl ether, tri- ^^^^ is capable of functioning as a monomer for copoly- 
isopropylsilyl ether, a 2 amino group protected by an menzation with other monomer units that may or may not be 
N-phthaloyl protecting group or a fluorescent label, R^^ is substituted with the polycyclic substructure of structure (1). 
independently R^, alkyl having 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 In some embodiments, the R^ linker is an alkyl carboxylate, 
carbon atoms or substituted alkyl having 1, 2, 3, 4, 5, 6, 7, amine or an amino acid for incorporation into 
8, 9 or 10 carbon atoms where the carijon atoms in R^^ are peptides by in vitro methods. However, in the typical 
optionally all present as methylene ( — CH2— ) or substituted embodiment the R^ polymeric binding partner is an oligo- 
methylene (— CH(substitution)-) moieties. Workers have nucleotide as depicted in structure (2), and these conve- 
described suitable 2' modified monomers and niently are made by copolymerization with a nucleotide 
oligonucleotides, e.g., WO 97/14706, WO 96/05298, WO analog substituted with the polycyclic substructure. The 
93/13121, and WO 91/06556 and U. S. Pat. Nos. 5,631,360, starting materials for the synthesis of structure (2) generally 
5,627,053, 5,623,065, 5,576,302 and 5,578,718. are compounds of structure (1) in which R^ is ribose or 
Methods for Svnthesis deoxyribose substituted with appropriate protecting and 
^ 60 coupling groups further described above. Suitable starting 
The compounds of structure (1) where R^ is a linker or H monomers for oligonucleotides having phosphodiester sub- 
are prepared by methods known in the art per se and as more stitute linkages are set forth in Table 1 , and they are prepared 
fully described below. Typically, such compounds are pre- in the same fashion as other nucleotide analog bases 
pared from a 5-bromouracil, 5-bromouridin-l-yl, described in the literature. Similarly, conventional phos- 
5-iodouracil, 5-iodouridin-l-yl substituted derivative as 65 phodiester or phosphorothioate linkages are prepared from 
shown in the synthetic schemes below and subsequent nucleotide anal<^ containing coupling groups D and 
reactions close the polycyclic ring. In these embodiments described above. The compounds of this invention then are 
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incorporated into the desired oligonucleotide by known 
methods of in vitro synthesis described in the referenced 
methods. Alternatively, polycylic substructure-substituted 
nucleotide triphosphates may be incorporated into oligo- 
nucleotides as cytosine analogs by DNA polymerase or 5 
reverse transcriptase in vivo or in vitro (see Ward, U.S. Pat. 
No. 4,711,955). In this case, is ribosyl or deoxribosyl 
triphosphate, or a triphosphorylated analog thereof recog- 
nized by DNA polymerase or reverse transcriptase which is 
then incorporated into an oligonucleotide by template- lo 
directed transcription. 

Synthesis of oligonucleotides containing 3 or more nucle- 
otide residues is optionally accomplished using synthons 
such as dimers (which contain substitute or diester liiokages) 
or trimers, each carrying a terminal coupling group suitable 15 
for use with amidite, H-phosphonate or triester chemistries. 
The synthon is then linked to the oligonucleotide or another 
synthon via a phosphodiester or phosphorous-containing 
phosphodiester substitute linkage. 

Oligonucleotides containing phosphorothioate, meth- 20 
ylphosphoaate and phosphodiester linkages are readily pre- 
pared by solid-phase oligonucleotide synthesis techniques. A 
description of modiQcations useful in the synthesis of phos- 
phorothioate linked oligonucleotides are found, for example. 



in EP 288,163, wherein the oxidation step in solid phase 
automated synthesis using amidite chemistry can be inde- 
pendently adjusted at any step to obtain the phosphorothio- 
ate. An alternate method for synthesis of oligonucleotides 
with phosphorothioate linkages, via hydrogen phosphonate 
chemistry, has also been described (Froehler "NAR" 
14:5399, 1986), Sulfurization is accomplished using 
reagents such as tetraethylthiuram disulfide, dibenzoyl 
tetrasulfide, thiophosphoric acid disulfide, 3H-1,2- 
benzodithiol-3-one 1,1 -dioxide and the like as described 
(Vu, "Tet Lett" 26:3005, 1991; Rao, "Tet Lett" 33:4839, 
1992; U.S. Pat. No. 5,151,510; Iyer, "JOC" 55:4693, 1990; 
Dahl, "Sulfur Reports" 11:167, 1991). These sulfurization 
reagents are used with either pfaosphoramidite or hydrogen- 
phosphonate chemistries. Synthesis of phosphorothioate oli- 
gonucleotides having controlled stereochemistry is used to 
generate stereoregular invention oligonucleotides as 
described (EP 506,242). Thionomethyl phosphonate is pre- 
pared with methylphosphonamidite followed by sulfuriza- 
tion as described (Roelen, "Tet Lett" 33:2357, 1992) or with 
the sulfurization reagents described above. 

One prepares various structure (1) compounds as 
described below and in the examples. 




(100) 



Scheme A 



Pli3P,Ca4 CH2a2, 



^ep 1 




(101) 




step 2 



DBU 
+ 



Hr47^ 

(102) 



steps 

PhsP, DEAD 
CH2a2,R2AoH 
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(105) 



fit 



Scheme A depicts preparation of structure (1) coropounds 
where R^^ is —0— or and R^— OH is R"^ whidi 

has a free hydroxyl. 

Step one is conducted by heating the reaction mixture 
containing (100) in an organic solvent to at least about SO"* 
C, generally for about 3-4 hours. Step two is performed by 20 
reacting (102) in an organic solvent for about 6-48 hours, 
generally for about 10-20 hours at about 15*" C. to reflux 
temperature, generally at about 18-25° C. The moiety is 
linked under Mitsunobu conditions to (103) in step 3 by 
reacting about 1-1.5 equivalents of the alcohol, i.e., R^ — 
OH, using an activating agent as a leaving group, such as 25 
triphenylphosphine (PhgP) and diethyl diazocarboxylate 
(DEAD) to obtain (104). In step 4, (105) is prepared by 
forming the ring containing R'^'^ by (1) incubating (104) in 
a polar organic solvent, typically an alkanol containing 1, 2, 
3, 4, 5 or 6 carbon atoms, such as methanol or ethanol, 
containing a mild base such as NH3, TEA (triethylamine), 
DBU (l,8-diazabicyclo[5.4.0]undec-7-ene) or (2) refluxing 
in ethanol in the presence of potassium fluoride. GeneraUy 
(104) is incubated in saturated NH3 in methanol for about 
2-3 days to afford (105) 



(130) 



The use of (102) in which one R 



30 



-and the other 
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is — S — will produce a mixture. One optionally isolates 
each (104) component or one optionally converts the (104) 
mixture to a (105) mixture. One optionally separates the 
mixtures at any convenient point by standard methods, e.g., 
silica gel chromalogTdphy, or HPLC. 

When one prepares compounds according to scheme A 
and R^ is a monosaccharide, e.g., 2*-deoxyribose, 2'-deoxy- 
2'-R^^-substituted ribose or arabinose, the sugar hydroxyls 
in (100) are usually protected, generally using base-labile 
protecting groups, e.g., acetate, proprionate, butyrate, phe- 
noxy acetyl. The step 4 reaction under basic conditions 
removes the protecting groups, which facilitates the ring 
formation reaction resulting in (105). When R^ is not a 
monosaccharide, it is generally a protecting group or an 
optionally protected h'nker, e.g., — (TV^)j^R'^% where R"^^ 
is independently — CHj — , — O — , — S— , — ^NH — or 
—C(0}—, X is 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 and R^^ usuaUy 
is a functional group suitable for linking to a solid support, 
monomer or polymer, including — OR , — SR^, — N(R\, 
— C(0)N(R^)2, — NR^C(0)H, — C(0)H and — C(0)OR^, 
R^ is independently — H, a protecting group, or both R^ 
together are a protecting group. Generally the R"^^ group that 
is adjacent to R"^^ is ethylene ( — CHj — CH2 — ) and gener- 
aUy only 0, 1 or 2 R"^ are moieties other than— CH^— , e.g., 
one R"^^ is — O — or — C(0) — and the remaining R'^'^ are all 
— CH2 — . Exemplary compounds where R^ is a monosac- 
charide have the structures (130)-(134), which correspond 
to compounds (100)-(101) and (103)-(105) respectively. 
For compounds (130)h(132), R^ at sugar hydroxyls is typi- 
cally a base labile protecting group such as acetyl and R^'^ 
is typically — O — . During conversion of (133) to (134), the 65 

protecting group is removed from the sugar under ring 
closure reaction conditions, which facilitates ring closure. 



55 




R^O r21 



(131) 




(132) 



HN R'^'^H 




R^O 



r2A ^r47 ^27 



(133) 




RSQ r2^ 
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Structure (134) compounds may be converted to mono- 20 
mers suitable for oligonucleotide synthesis. Such monomers 
typically have a coupling group at the 3* position, e.g., 
H-phosphonate, or a phosphoramidite such as a 
p-cyanoethylphosphoramidite, N,N-diisopropylamino- p 
cyanoethoxyphosphine or N,N 

diisopropylaminomethoxyphosphine. The 5' position will 
contain a DMT-O — or other protecting group suitable for 
oligonucleotide synthesis. The monomers may alternatively 
have a coupling group at the 5' position and a protecting 30 
group at the 3* position. The protecting and coupling groups 
are added sequentially. Scheme B shows synthesis of struc- 
ture (1) compounds where in is — CHj — . In scheme 
B, Y is 1, 2, 3 or 4; is independently — CH^— 
— C(0)— — (CH.),— 0-<CH2)2— ^CH2)2~NR*'— 
(CH,)^-, ^CH,),-~S~-(CH,)2-, -CH(N(R^),)- 
-Ol(COOR^)— or — C(CH3)— , -4::(C^U^}-^ but adja- 
cent moieties are not C(0), usually R^° is —CH^—; TFA is 
trifluoroacetate; and CBZ is carboxybenzoyl; and (96) is 
HteC(R^°)3,— NH-TFA. Protecting groups present in R^° 
are stable to the reaction conditions shown. 



Scheme B 



35 




(106) 



TEA 



65 
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(107) 



10 (107) i^ki^l^ (108) 



2. NH40H(conc.) 
dioxane (1:1) 



'pyridine 
C6H5CH20C(0)a 



H2CH2(R^NHCBZ 



< 



(109) 




CHaCHaCR^V^CBZ 



(110) 




CBZNH(R50)yCH2CH2 




(112) 



NH3/CEI3OH 



CBZNH(R5VcH5CH2v^ .R2^ 
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Compound (97) is converted to (98) by hydrogenation 
reaction in alcohol, usually methanol or ethanol, at about 1^ 
15-25*' C. for about 10-24 hours, usually about 12-18 
hours. The catalyst is removed and the filtrate is concen- 
trated. 

Compound (98) in trimethyl orthoformate is converted to 
(99) in the presence of acid, e.g., methanesulfonic acid at 20 
about 15-25° C, usually about 18-22*" C. for about 20-120 
minutes. The reaction is cooled and quenched with a base, 
e.g., an organic base such as TEA. The reaction mixture is 
concentrated and purified, e.g., by flash column chromatog- 
raphy on silica gel. 25 

Compound (106) is prepared by reacting (99) in organic 
solution such as DMF, CH^Cl^ or CH^Cl^/DMF (about 2:1 
v/v) with K2CO3 at about 15-25** C. for about 30 nainntes, 
followed by adding phenyltrifluoromethanesulfonimide and 
stirring the mixture for about lQ-24 hours, usually about 
12-16 hours. The reaction mixture is then diluted with 
CH2Cl2> washed with water once or twice and concentrated 
and purified. 

Compound (96) is prepared by reacting 
Hte— C(R^^yNH2 with ethyl trifluoroacetate at about 
15-25® C. for about 10-24 hours, washing with saturated 35 
aqueous NaHCOg and concentrated. Compound (96) is 
purified by distillation. 

Compound (107) is prepared by stirring an organic sol- 
vent such as DMF containing about 2 equivalents of (96), 
about 2 equivalents of an organic base such as TEA, and 40 
Pd((PPh)3)4, Cul and about 1 equivalent of (106) at about 
15-25° C. for about 18-36 hours. The organic phase is 
washed with water, dried and purified by silica gel 
chromatography, to obtain (107). 

Compound (107) is hydrogenated in ethanol in the pres- 45 
ence of 10% Pd/C at room temperature. The catalyst is 
filtered off, the filtrate is concentrated and then treated with 
cone. NH40H:dioxane (1:1) to attbrd (108). The amino 
group in (108) was protected with CBZ to afford (109). 

Compound (110) is prepared by treating (109) in ethanol 50 
with aqueous HCl (about 3 N) at about 15-45° C, usually 
about 40° C, for about 30-120 minutes, usually about 60 
minutes. The product is dried and optionally azeotroped 
using e.g., CH3CN, several times. 

Compound (111) is prepared by reaction of (100) with 55 
mesitylenesulfonyl chloride in the presence of a tertiary 
amine such as TEA. 

Compound (112) is prepared by stirring a mixture of (110) 
and (111) in organic solution containing about 2 equivalents 
of an organic base such as DBU or TEA at about 15-25° C. 60 
for about 10-24 hours. The reaction mixtiue is washed with 
an aqueous 10% citric acid solution, dried and purified by 
silica gel chromatography. 

Compound (113) is prepared by treating (112) with satu- 
rated NH3 in methanol at about 15-25° C. for about 3-4 65 
days. The reaction mixture is dried, concentrated and puri- 
fied by silica gel chromatography. 



183 
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Compound (114) is prepared by hydrogenation of (113) in 
the presence of 10% Pd/C at about 15-25° C. for about 3-6 
hours. Catalyst is removed, washed, and the filtrate is 
concentrated to dryness. The amino group in (114) is pro- 
tected with FMOC to afford (115). 

Where in scheme B is an optionally protected 
monosaccharide such as 2'-deoxyribose, 2-deoxy-2*-R^^- 
substituted ribose, 2'-deoxy-2'-R^^-substituted arabinose, 
ribose or arabinose, the sugar's hydroxyl groups in com- 
pounds (111) and (112) are usually protected with a base- 
labile protecting group such as acetyl, propionyl and phe- 
noxyacetyl. These protecting groups are removed by 
treatment with base during synthesis of (113). 

Exemplary compounds where is a monosaccharide 
have the structures (135)-(139), which correspond to com- 
pounds (ill)-(115) respectively. For compounds (135)- 
(136), at sugar hydroxyls is typically a base labile 
protecting group such as acetyl. R^*' is typically — O — . 
During conversion of (136) to (137), the R^ protecting group 
is removed from the sugar under ring closure reaction 
conditions, which facihtates ring closure. 

Structure (139) compounds may be converted to mono- 
mers suitable for oligonucleotide synthesis. Such monomers 
typically have a coupling group at the 3' position, e.g., 
H-phosphonate, or a phosphoramidite such as a 
p-cyanoethylphosphoramidite, N,N-diisopropyl-amino-P- 
cyanoethoxyphosphine or N,N- 

diisopropylaminomethoxyphosphine. The 5' position will 
contain a DMT-0- or other protecting group suitable for 
oligonucleotide synthesis. The monomers may alternatively 
have a coupling group at the 5' position and a protecting 
group at the 3' position. The protecting and coupling groups 
are added sequentially. 



(135) 
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Scheme C shows synthesis of structure (1) compounds 
where in is — NH— . 



Scheme C 



(101) or 
(111) 




In scheme C, R^^ is — (CHR^^)— (R^^^)— CHR^^— , 
where R^^ is — H or C^-C^ alkyl (typically C^-Cs), usuaHy 
— H, and R^^ is a bond, --CHR^^-^—CHR^^—, 
— CHR^^— S— CHR^^— , —CHR^^^— NR^— 
CHR^2A_ Q_c^^ alkylene (typically Cg-Cg) optionally 
substituted with 1 or 2 moieties selected from the group 
consisting of Ci-Cg alkyl (typically C^-Cg alkyl), — 0R% 
=0, ^N02, -^N3, -CN, — COOR^ or — N(R^).. In R^^, 
any hctcroatom in the spacxr chain wiU be separated from 
the nitrogen atoms that R^^ is linked to by one methylene 
and one or more — CHR^^ — moieties. Typically, adjacent 
carbon atoms in R^^^ are not substituted with — OR^, =0, 
— NO2, — N3, — CN, — COOR^ or — N(R^)2. Protecting 
50 groups present on (119A) are usually stable to oxidizing 
conditions and labile to basic conditions. The protected 
intermediate (11 9A) is prepared by reacting phthalic anhy- 
dride with the appropriate HOCH-^^^— NH2 moiety to 
yield a phthalimide compound or by reacting a phthalimide 
55 compound with HOCH—R^^— Br. 

Compound (117) is prepared by reacting (101) or (111) 
with about 1.0-1.5 equivalents, usually about 1.1 equiva- 
lents of (116) and about 1-2 equivalents, usually about 2 
equivalents of organic base such as DBU or TEA in an 
60 organic solvent such as CH2CI2/CCI4 (about 1:1) or CH2CI2 
by reaction at about 10-30° C, usually at about IS-IS"" C. 
for about 16-48 hours, usually about 20-28 hours. 

Where R^ in scheme C is an optionally protected 
monosaccharide such as 2-deoxyribose, 2'-deoxy-2'-R^- 
65 substituted ribose, 2'-deoxy-2'-R^^ -substituted arabinose, 
ribose or arabinose, the sugar's hydroxyl groups in com- 
pounds (101), (111) and (117) are usually protected with a 



HO 
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base-labile protecting group such as acetyl, propionyl and -continued 

phenoxyacetyl. These protecting groups are removed by ^42) 
treatment with base during synthesis of (118). Compound 

(118) was hydrogenated in ethanol and acid to obtain (119). 
Compound (120) was obtained by reductive alkylation of ^ 

(119) with aldehydes. 

Compound (120) is optionally converted, without remov- 
ing the nitrogen protecting group linked to the exocyclic 
amine, to a monomer suitable for use in oligonucleotide lo 
synthesis, e.g., (121) (not shown), by protecting the 5' 
hydroxy! group by reaction with a protecting group reagent 
such as DMT-Cl (4,4'-dimethoxytrityl chloride) to obtain the 

5'-protected derivative (not shown) . Compound (121) is then y— i 
converted to a derivative (121A) (not shown) suitable for / \ 

coupling the 3' hydroxyl group with another 5' hydroxy! ^ 
group, ix., a coupling group such as H-phosphonate or a ^^"^^^ 
phosphoramidite, e.g., N,N-diisopropylamino-P- 
cyanoethoxyphosphine or N,N- 

diisopropylaminomethoxyphosphine, is Hnked to the 3' posi- 
tion. Exemplary compounds where is a monosaccharide 
have the structures (140)-(143), which correspond to com- 
pounds (117)-<120) respectively. For compounds (140)- 
(141), at sugar hydroxyls is typically a base labile 25 
protecting group such a§ acetyl and R^"^ is typically — . 
In structure (143) compounds, the 5' or 3' oxygen, usually 
the 3" oxygen, is linked to — or a coupling group such as 
H-phosphonate, or a phosphoramidite such as 
P-cyanoethylphosphoramidite, N,N-diisopropy!amino-P- 30 
cyanoethoxyphosphine or N,N- 

diisopropylaminomethoxyphosphine. During conversion of 
compound (140) to (141), the R^ protecting group is 
removed from the sugar under ring closure reaction 
conditions, which facilitates ring closure. 





(140) 

Structure (143) compounds may be converted to mono- 
mers suitable for oligonucleotide synthesis. Such monomers 
40 typically have a coupling group at the 3' position, e.g., 
H-phosphonate, or a phosphoramidite such as a 
p-cyanoethylphosphoramidite, N,N-diisopropylamino-P- 
cyanoethoxyphosphine or N,N- 

diisopropylaminomethoxyphosphine. The 5' position will 
45 contain a DMT, MMT or other protecting group suitable for 
oligonucleotide synthesis. The monomers may alternatively 
have a coupling group at the 5' position and a protecting 
group at the 3' position. The protecting and coupling groups 
are added sequentially. During oligonucleotide synthesis, 

(143) is incorporated into an oligonucleotide such, as one of 

r5o \^ smicture (2), by standard methods and the phthalamide 

protecting group, along with other base labile protecting 
(141) groups at R^^ are removed using basic conditions, e.g., 
NH4OH or NH2CH3, to yield deprotected or partially depro- 
tected — NH— R^2— NH2 as R^ 

Straightforward variations of schemes A-C can be used to 
prepare other structure (1) compounds. For example, scheme 
D depicts a method to prepare structure (1) compounds 
where R^ comprises a cytosine derivative. In scheme D, iPr 
is isopropyl; R^^ is a portion of an R^ moiety having the 
structure — r6-.-R60_ ^i^^^^ r6 ^ ^g^aUy — O— , — -S — , 
d'^^N^ — NH— or —CH^— ; R^^ is 

^..^^^^^ -<CH.).3-^R^%,-(CH^,,-; R^^ is -0~, 

\ 7 -NR^-, -C(0)-, ~CH2-0--CH,-~, -CH^- 

\ / NR^— CH2— or — CH2--S— CH2-; Z3 is 1, 2 or 3, 

/ \ 21 usually 1; Zl is 0 or 1, usually 0; and Z2 is 1, 2 or 3, usually 

^ 1; any functional groups, e.g., —OH, — NH^, —COOK, 

SH, that are optionally present at R^^ 
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are protected. Compound (124) is S'jS'-diacetyl-O'^-siilfonyl- 
2'-deoxyuridme, the synthesis of which is described in the 
examples. 



(125) 




(asi(CH(CH3)a)2 — 




iPi^ 



I / % 



(123) 



iPl 



iPr 



CH2CI2 
DBU 
(124) 




10 



15 




Compound (126) of scheme D is converted to a monomer 
suitable for oligonucleotide synthesis using standard 
methods, e.g., treatment of (126) with a protecting group 

20 such as DMT-a protects the 5' oxygen atom and yields 
compound (127) (not shown). One then links a coupling 
group such as H-phosphonate or a phosphoramidite group 
such as N,N-diisopropylarnino-(3-cyanoethoxyphosphine or 
N,N-diisopropylaminomethoxyphosphine to tiie 3' hydroxyl 

25 group, of (127) to obtain (128) (not shown), which is 
suitable for oligonucleotide synthesis. Variation of the syn- 
thesis shown in scheme D is used to prepare compounds of 
structure (129) or (129A). 



30 



35 



40 



45 
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(129) 




OAc 



(129A) 




Similarly, structure (1) compounds where is 



— — ^r«onh — / \ , 

(CH^i^NH— \;or 



inclading 



45 



-continued 



NR^ 

-r6 ^j^eo^H — II ^NHR^^lciudmg 

"(CHi)i_5NH II ^NHR5 



are synthesized using variations of scheme D and the 
corresponding intermediates, e.g., reaction of (123) with 
2-halopyridine (Bematowicz "JOC* 57:2497 1992). 



6,028483 

including 



46 




10 



(R^)2N (CH2)i_j R« , 



and is — O — or — S — are synthesized using scheme A, 
while scheme B is used when is ~-CH2— and scheme C 
is used when R^ is — ^NH — , Compounds containing these 



structures where R^ 



or — 0 — , in general are 



Compounds of struaure (1) where R^ is 



r62 



..A 



w 



-R* J induding 



N 
H 



R^-R 
Jl \ 




synthesized using Scheme A and compound (103), e.g., 
using #1154-093 described in the examples below, and the 
corresponding protected alcohols. Compounds where R^ is 
15 — CH2 — are obtained by following Scheme B. Compounds 
where R^ is — ^NH — can be obtained using compound (119), 
e.g., using #1090-68 and protected aldehydes described in 
the examples below, by following Scheme C. 

Schemes E and F illustrate the synthesis of alcohols 
containing an imidazole moiety (see, e.g., Munk "J. Med. 
Chem." 40:18 1997). In scheme R^^ is R^ or alkyl having 
1, 2, 3, or 4 carbon atoms and R^^ is — OR^, — <CH2)^^0R^ 
or -(CH2)i.2->R^MCH2)i.2-0R^ and Z4 is 0, 1, 2, 3, 4 
or 5. In scheme F, Z5 is 1, 2, 3, 4 or 5. 



25 



where R^^ is — N— , — CH— or — C(CH3)— , R^^ is'— H, 
— NH2 or — NH(CH3) and R^ is —0— or ~S— • are 
synthesized using scheme A, while scheme B is used when 
R^ is — CH2 — and scheme C is used when R^ is — NH — . 

Compounds of structure (1) where R^ is 



R^^1SIH2 + BrCSN 



Scheme E 
IT 



r62n=C=NH 



r62n=c=NH 



30 



H2NCH(CH2R^^CH(OCH2CH3)2 



N ^ 

R*%H ^ CHjR^ 



•NH ^R^^ 



including 



R" 



•NH(CH2)i_2 ^R^ , 



35 



40 



Scheme F 
R^O(CH2)z4NH2+ BiC^N 



R^O(CH2)z5N=C=NH 



R« 



R^O(CH2)z5N=C=NH «WH(OCH2CH3)2 



45 



R5o(CH2)25NH- 



where R^^ is — H, alkyl having 1, 2, 3 or 4 carbon atoms or 
optionally protected substituted alkyl having 1, 2, 3, 4, 5 or 
6 carbon atoms including — CH3 and — CH2CH3, and R^ is 
— O — or — S — are synthesized using scheme A, while 
scheme B is used when R^ is — CHj— and scheme C is used 
when R^ is — NH— . 

Compounds of structure (1) where R^ is — R^ — R^°— N 
(R3)2, including — RM^H^) — N(R3)2, and R^ is — O— 
or --S — are synthesized using scheme A, while scheme B 
is used when R^ is —CB^— and scheme C is used when R^ 
is— NH—, 



Scheme D is also used to obtain compounds of structure 
(1) where R^ comprises, for example, a moiety such as 
-(?H2),^4r:NH((CH^^^^^^^ 

X ^T^/^^^ V ^r,, ^ when is 

a 



50 ^CH^,.,NH(CH^^.4N((CH2). 4-NH 

R'(CH2)^(R\ and one R^ is H, CH„ CH.CH 



Compounds of structure (1) where R^ is 



(R^)2N 




60 



65 



protecting group or -KCH^X— N(R^^)^ and the other R^ is 
-CH(N[R^^]^N(R^% (48), (49) 

or (50). 

Converting a compound of structure (4) to a compound of 
structure (1) is accomplished by a method comprising dis- 
placing R^^ In this method, R^^ usually is —Br and R^'^ is 
usually — CH — 

R^ groups containing a diol, e.g. 
(CH^OH)^— CH-~(CH2)„-0— where n is 1^, are con- 
verted by cleavage to the aldehyde CHO— €H2-~(CH2)„— 
0— using sodium periodate. Reductive alkylation of the 
aldehyde to a primary or secondary amine is accompHsfaed 
usic^ N(R^)2 where R^ independentiy is — H, C^ ^ alkyl, 
including — CH3, — CH2CH3 or a protecting group, usually 
— II or — CII3, 

To the extent that a compound within the claims' scope 
can not be directly synthesized using the schemes above or 
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the examples below, the artisan will employ straightforward different fluorophor such as fluorescein or rhodamine 

methods known for preparing such compounds, see e.g., B. capable of energy transfer. The two oligonucleotides recog- 

M. Trost & 1. Fleming, cds. "Comprehensive Organic nize a complementary sequence in which the 3' end of the 

Synthesis", volumes 1-8, Pergamon Press; M. Fieser, ed. target sequence binds the oligonucleotide bearing the 

"Fieser and Fieser's Reagents for Organic Synthesis**, vol- 5 3 -terminal fluorophor and the adjacent 5' sequence of the 

umes 1-17, John Wiley & Sons; and A. R Finch Ed. to the oligonucleotide bearing the 5' terminal 

"Tfaeilheimer's Synthetic Methods of Organic Chemistry", fluorophor. Binding is assayed by measuring a change in 

volumes 1-49, latest editions, S. Karger AG. fluorescence of either or both of the oligonucleotides when 

they bind in tandem according to the illustrated model. In 

Uses for the Compoimds of this Invention ^^^^ embodiments only a single labeled oligonucleotide is 

™ J , . employed in the hybridization method. The oligonucleotides 

The compounds of this mvention find uses m the of this invention thus are useful in solution phase hybrid- 

diagnostic, analytic and therapeutic fields or as intemedi- ization diagnostics, i.e., it is not necessary to perform a 

ates m the preparation of compounds useful m such fields. phase separation in order to detect labeled oHgonucleotide 

The R^-substituted compounds of structure (1) are useful binding, 

as intermediates in the preparation of the labeled biopoly- Detecting a target base sequence in a nucleic acid or an 

mers of structure (1), wherein a biopolymer is rendered oligonucleotide using an invention oligonucleotide is 

fluorescent or otherwise detectably labeled by linkage to the accomplished by a method comprising (i) mixing or con- 

polycyclic substructure. It is most convenient, however, to tacting a sample suspected of containing a nucleic acid with 

use the appropriate structure (1) compounds as monomers in an optionaUy labeled invention oligonucleotide comprising 

the preparation of nucleic acids or oligonucleotides. The at least about 7 bases, usually about 12-30 bases, where the 

labeled biopolymers are employed in diagnostic assays or protecting groups have been removed, (ii) allowing time 

preparative procedures in the same fashion as other sufficient for the invention oligonucleotide to bind to the 

fiuorophor-labeled biopolymers, e.g. in fluorescence polar- target base sequence, (iii) separating unbound invention 

ization methods, fluorescence activated cell sorting, oHgonucleotides from bound invention oligonucleotides and 

competitive-type EMIT immunoassays and the like. " (iii) detecting the presence, absence or amount of bound 

The linker- and hydrogen-substituted compounds of invention oligonucleotide. Aspects of the invention include 

structure (1) are useful as intermediates to prepare materials conducting any one of these steps individually, which are 

suitable for use in afEnity purification ofguanine or guanine- each part of the complete method. The use of known 

containing compounds, e.g., nucleosides. The structure (1) 3Q hybridization methods and conditions are applied to accom- 

compounds form hydrogen bonds with guanine and one can plish the method. 

thus link the structure (1) base structure to an appropriate In this method to detect target base sequences, one may 

support material or polymer to prepare an affinity resin optionally use invention oligonucleotide and target base 

smtable for binding to guanine-containing compounds, e.g., sequences that are substantiaHy complementary to each 

nucleosides, nucleotides and oHgonucleotides or similar 35 other, i.e., sequences having 1 or no mismatches per about 

compounds containing guanine analogs, e.g., every 15-30 bases. The target base sequences to be detected 

7-deazaguanine. Typical linker-derivatized structure (1) are generally present in a cell, in a cell or tissue extract or, 

^^S^l^"^^ ^optionally contain a linker of structure usually, in a purified nucleic acid or oligonucleotide 

R , defined above in the discussion under preparation, e.g., a sequence encoding a portion of a 

Scheme A. cytokine, cell surface molecule, an enzyme such as famesyl 

The monomers are of particular use in preparing oligo- protein transferase or an oncogene such as neu, myc, raf, ras 

nucleotides for diagnostic or therapeutic use. Since oligo- or c-ras. The target base sequences to be detected are 

nucleotides having 2 or more nucleotides or nucleotide generally present in RNA or single stranded DNA, although 

analogs bearing the polycyclic substructure will usually the invention oligonucleotides are also useful to detect base 

exhibit greatly increased Tm, such oligonucleotides are 45 sequences in duplex nucleic acids, 

particularly useful in therapeutic or diagnostic utilities Another invention embodiment is a method comprising 

where highly stable duplex hybridization structures are incubating a cell with a deprotected invention oligonucle- 

desired. Since these oligonucleotides frequently are otide containing at least about 7 bases, usually about 12-30 

fluorescent, changes in the oligonucleotide fluorescence can bases, wherein the invention ol^onucleotide is optionally 

be followed as the oligonucleotide binds to complementary 50 present in a transfection complex comprising the invention 

nucleic acid or oligonucleotide sequences. These changes oligonucleotide and a permeation enhancing agent. This 

are detectable as modifications in energy transfer, e.g., method is used to introduce optionally labeled invention 

fluorescence quenching or shifts in activation or emission oligonucleotides into cell cytoplasm or vacuoles. 

wavelength(s). One optionally conducts this method using invention 

The polycyclic substructure labeled oligonucleotides are 55 ol^onucleotides having a octanolwater partition coefiBcient 

employed in diagnostic or analytic methods in the same of about -0.5 to about 2.5, typically about 0.0 to about 2.0, 

fashion as other labeled oligonucleotides. For example, the usually about 0.5-1.5, and a solubility in water of at least 

oligonucleotides are used in hybridization methods in which 0,001 /ug/mL, However, in these embodiments, no perme- 

an antft>ody capable of binding base-paired structure (1) is ation enhancing agent is usually needed to introduce the 

used to detect binding of the oligonucleotide to a target 60 compound into the cells. 

nucleic add sequence. In addition, changes in fluorescent One can use oligonucleotides containing 1, 2, 3 or more 

character can be assayed as described above. Typically, 2, 3 invention base(s) to detect a base pair mismatch in a nucleic 

or more polycyclic substructure labeled oHgonucleotides are acid sample using ribonuciease protection assay methods 

used in a hybridization assay. One oligonucleotide is labeled described in U.S. Pat. No. 5,589329. Thus, one can use the 

at its 3' end with a polycycHc substructure containing 65 invention compounds as described in step (a) or step (b) or 

nucleotide while the other nucleotide is labeled at its 5' end step (c) (or, in sequence, steps a, b, or steps b, c or ste^ a 

with the same or another polycyclic substructure or with a b, c) of claun 1 of U.S. Pat. No. 5,589,329 to practice that 
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claimed method (or to practice necessary steps in the claim labeled oligonucleotides containing 1, 2, 3 or more invention 

1 method, e.g., contacting an RNA probe containing an bases, see e.g., U.S. Pat. Nos. 4,910300, 5,093 232 5 124 

invention base with a single stranded nucleic acid to form a 24^, 5,202,231, 5,258,506, 5,202,231, 5,525,464, 5,578,71?' 

duplex). One can similarly use oHgonucleotides containing 5,578,715, 5,578,467, 5,591,584, 5,599,932, 5,599 668* 

an invention base(s) to screen mammaHan genomic DNA 5 5,593,841, 5,578,458, 5,589,332, 5,589,333, 5,589339 

samples tor insertions, deletions or substitutions using 5 589 342 5SQ^R^n S^O^s^i ^^o-^s^o ^^q-x q-xa^ 

330. Thus, one can use the invention compounds as c <Qr jc.^ c /cn^ 704 ^^;n< 7oa <^;n< toc oo^ 
described in step (ii) or step (iii) or step (iv) or step (v) (or, If^J^^' ^'^g'Jg' f ^g'^ 5' ^.g5,798, 5 605 824, 
in sequence, steps i,u, or steps i,ii,m or steps m,iv, or steps ^'„q^, c^^'^?, Izitini' All'lll' AlliXl' 
i, ii, iii, iv, etc.) of claim 1 of 5,589^30 to practice that " IHHI} Ik^^^^i/' t¥^^'5°^', ^^^u?'^*'' ^r^^'^^A 
Claimed method (or to practice necessary steps in the claim ' 5.641,631, International Pubbcation Nos. WO 

1 method, e.g.. contacting an oHgonucleotide containing an ^/^^' ^0 97/06252, WO 97/06183, WO 97/04787, 
invention base with an immobiHzed genomic DNA saraple 97/05280, WO 96/41017, WO 96/41012, WO 

to form a triplex or duplex). One can similarly use oligo- 9^40994, WO 96/40996, WO 96/40991, WO 96/40992, 
nucleotides containing an invention base(s) to design a 15 WO 96/41016, WO 97/04126, WO 97/04129, WO 96/06950 
synthesis method for an array of materials to be synthesized European Publication No. 761 822. For each of these 

on a substrate using methods described in U.S. Pat. No. applications, one would use an invention ohgonucleotide in 
5,593,839. Thus, one can use the invention compounds as pla«= of one or more of the described oligonucleotides. To 
described in the first part of the second step or the second practice these and other typical applications, one will typi- 
part of the second step or the third part of the second step or 20 cally use an invention oligonucleotide in one or more of the 
the fourth part of the second step or (or, in sequence, in the steps needed to practice the methods described in these 
first step and in the first part of the second step, etc.) of ckim pubHcations. In many of these s^Kcations, one wiU use an 
1 of 5,593,839 to practice that claimed method (or to invention oligonucleotide containing (i) about 7-50, usually 
pracuce nectary steps in the claim 1 method). about 8-30 linked monomers, usually where the oligonucle- 

One can ako ^e the invention monomer con^oations 25 otide has a uniform polarity, (ii) 1, 2 or 3 invention bases, 
con ammg a 5 a- S-ttuotnphosphate group or a 5' triphos- (iii) purine and pyrimidine bases having a base sequence 
phate group linked to 2',3 -dideoxyribose to perform dideoxy complementary or substantially complementary to a target 
DNA sequencing methods. One may use invention mono- sequence, i.e., a defined base sequence having no more than 
meis m kite tot optionaUy contain buffers or enzymes about 1, 2, 3 or, for relatively long oligonucleotides (about 
smtable for DNAsequencing. The mvention monomers may 30 35-50-mers), 4 base mismatches, and, optionally, (iv) other 
be advantageously used m enzymatic DNA sequencing moieties or features, which are readily apparent to the skilled 
protocols because the mvention monomers, which act as artisan who has read this disclosure, that facilitate using the 
cytosmc surrogates, have a high affinity for guanosinc and ohgonucleotide in the intended application, e.g., (i) a free 
may perform better than cytidine 5' triphosphate in sequenc- hydroxyl group at the 3' terminus for applications where the 
mg reactions, particularly where the DNA to be sequenced 35 one wishes to use the invention oligonucleotide as a primer 
contains a high proportion of guanosine residues, which can in enzyme-mediated chain elongation appUcations, (ii) a 
cause sequencing prcblems. fluorescent label, enzyme label or radiolabel to facilitate 

Ohgonucleotide analogs containing 1, 2, 3 or more inven- oligonucleotide detection in a particular assay, (iii) biotin 
tion bases are also suitable for binding to open complexes in Unked to a convenient location such as the 5' or 3' terminus, 
cells or cell lysates of eukaryotic or prokaryotic cells. Open 40 or (iv) a free 5' hydroxyl group for enzymatic phoq)hory- 
complexes may also arise in systems comprising at least lation. 

partially purified cell components, e.g., RNA polymerase, When one adapts the presently claimed compounds to 
?>xTA°u transcription cofaciors, nses known in the art, one will use known hybridization 

DNA bmdmg protems and duplex DNA capable of tran- conditions, enzyme (such as polymerase, RNase or DNase) 

°^ « conditions or detection technologies to design the 

DNA or RNA that occur at least ti;ansiently during duplex invention oligomicleotide in a way that does not interfere 
nucleic acid metabohsm, e.g., durmg DNA replication or with the intended use, or in a way that improves the imended 
RNA tra^phon m flie nucleus, cytoplasm, plastids or use. For example, when a previously described assay or 
mitochondria. Su<^ DNA rephcation or RNA transcription diagnostic method calls for conductiiig an oUgonucleotide 
may involve metabohsm of cellular, viral or other nucleic so hybridization or enzyme amplification protocol at a specified 
acids. One can use bmdmg of invention oligonucleotides to temperahire, one will have the option of increasing the 
smgle stranded open complex sequences to affect nucleic hybridization or enzyme amplification temperature due to 
acid metabohsm e.g one may inhibit RNA transcription or the presence of an invention base(s) in the oKgonucleotide. 
one may use the ohgonucleotides, which are optionally Thus, one can use the enhanced binding affinity or enhanced 
labeled, to detect the presence of open complexes. Such ss binding specificity property of the invention oligonucle- 
ohgonucleotides will typicaUy oompnse about 8 to 30 otides to increase hybridization stringency. Similarly, when 
monomers, usually about 12-21 monomers, having a one intends to use an invention oligonucleotide in a poly- 
sequeiice complementary to a smgle stranded open complex merase chain reaction (PGR) amplification method, one 
region(s) mvolved m the mitiation of a nucleic acid meta- would initially test to see if the presence of an invention base 
bohc event e.g., initiation of RNA transcription in a pro- 60 at the 3' terminus improves the oUgonucleotide 's primer 
moter or transcnption imtiation region. Workers have fimction and then adjust the reaction conditions accordingly, 
described open complexes and their formation in vitro and e.g., by altering the primer to target sequence ratio, primer 
."^P^f N fr TT?"'^'':T>: «=Ic'^t^ nof t' -«tio» or by 'altering the Lpe^ahire at whichTne 

B!rch^S;?68 655'J;^^Qf ^^^^^^ denaturesandamplifiesthePCRreartion.Ifthepresenceof 
iiioi ctiem 268.6535-40 1993. ^5 an invention base s^nificantly affects polymerase-mediated 

WorkMS have described other appHcations that one can primer elongation, then one could choose to design inven- 
practice ma similar straightforward manner using optionally tion oligonucleotides for tins use without an invention 
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base(s) at the 3' tennmal 1, 2 or 3 moiwmer positions. invention polycycUc bases, usually 2, 3 or 4, which can be 

Skilled artisans routinely design diagnostic or assay proto- the same or different polycycles but generally are the same 

cols by tcsUng varying temperature, salt composition and invention polycycle. They also optionally are copolymerized 

concentration, pH, oligonucleotide concentration, enzyme with purine or pyrimidine bases containing known alkynyl 

concentration, enzyme reaction buffer, net oligonucleotide 5 substitutions (e.g., U.S. Pat. Nos. 5,645,985 and 5,594,121), 

charge, or oligonucleotide base, sugar or linkage structure ^ particular pyrimidine bases substituted at the 5 position 

during assay development. with a carbon atom which is bonded to another atom by a Pi 

Embodiments include a method comprising preparing a *® fluorescent cytosine derivatives of Inoue et al. 
series of oligonucleotides, each having the same base 

sequence, which sequence contains 2 or more cytosine 10 7^^ compounds of this invention, or other oligonucle- 

bases, where each member of the series contains an inven- capable of forming high melting duplexes (e.g. the Pi 

tion base of structure (3) in place of one or more of the bonded bases discussed above), are useful in improved 

cytosine residues. Such oligonucleotides typically comprise methods fiar polymerase chain reaction ("PCR") or Hgase 

2 to about 6 cytosine residues. One prepares oligonucle- ("LCR") ainplification and detection of 

otides containing an invention base at each cytosine position 15 nucleic acids. In one embodiment, the high melting oligo- 

and optionally one prepares oligonucleotides containing an nucleotides are used as one or both primers in classical PCR 

invention base at each of two cj'tosine positions, at each of probes in LCR. Particularly in PCR processes, the 

three cytosine positions and so forth. One uses this method elevated melting temperature of duplexes with high melting 

to determine which oligonucleotide(s), compared to a con- Primers avoids the need to theamally cycle the reaction 

trol containing no invention base(s), has optimal properties 20 because at these elevated temperatures (about 68 to 95° C, 

for a desired application, e.g., hybridization afiSnity for use preferably greater than about 75° C; the derivative primer 

as a probe or optimal antisense activity for inhihiti!^ target ^ continue in at least some prqjortion to anneal to the 

gene expression in a cell. '^'S^t but extension product will not. Ordinary primers will 

Stmctuie (5) monomer, when triphosphorylated and con- n^l'lSf P^iyf^^^ase wiU not initiate transcrip- 

taining R^ ribose or deoxyribose derivatives that are chain ^5 ^'^^ the reaction mixture is cooled to a level at which 

terminating (e.g. where the 3' position is not hydroxyl), are Jfo^r'T^'^^ *° target sequence (usuaUy, about 

useful in methods for fluorescent chain-terminating C.)^TTie elevated temperature that is chosen for use with 

dideoxymicleotide sequencing in the same general fashion Jl^f'rJJT^ "''T °^g°°"'rl^tides (a temperature 

as ddNTPs having other linker-attached fluorophores. suitable for all of annealmg, extension and meltmg) is one 

<:4„^» ,-A J , r 30 *t "mnch a substantial proportion of the extended primer 

Since ohgonucleotide compounds such as those of struc- ^ population (about 10 to 90 mole %) is 6,^ ^50^ 

t^SZ^'^^^tiMl'^^S^^^^^^ fromtheta4et,butsuffidentunextei.LpSrrbSt^ 

painng they will b nd to nucleic acds and herefore are pennit extension. Optimally, this is about from 85 to 95° C 

s~re m S^Ll r'"""? u ^'"^ ""^''^y ^ »° ^^''^C. Alternatively, the op^^ tempeS-' 

structure (1) m such ohgonucleotides will recogmze gua- ture is determined empirically by simply selectine a ranse of 

nosme as Its complementary base in natural nucleic acids. te„>peratures within ^the mel^ngTnge"^ 

o 1 w'^^? oligonucleotides, including many structure (2), sequence, but within the annealing range of the derivative 

(2B) and (20) ohgonucleotides capable of fonning primers, and measuring the amount of amplification product 

high melting duplexes with complementary sequences, are to achieve satisfactory levels for the diagnostic or prepara- 

usetul m numerous applications, including antisense or ^ tive processes at hand. Amplification methods have been 

codeWocking utdities m vivo or m vitro as well as diagnos- described, e.g., U S Pat No 5 667 974 

w Z^^ T" "^S'? '^^f ''^^ r '"'^"S ^ exemplary method to use In inVention ohgonucleotide 

melting temperatures substantially above the melting tem- to amplify a nucleic acid base sequence compLs provS! 

ST "'^f °r °^ ^^^'^ ^^^'^^ of the ing an' invention oHgonucleotid^ and t^TSek a^kl 

same sequen^ that contain the ordi^, naturally occurring sequence that forms a complex having a Tm of about 85 o 

£^nf t'^%""t'' C, optionally heating the complex'to about 85 "9'° C 

dme and the like. Substantially above" means that the (e.g., to a temperature within about 5° C. of the Tm) to 

complementary sequence, will not dissociate from the complex with a DNA^ olymerase such as Tag polymera«i or 

ordinanly about 8 to 40 C, above the dissociation tem- complex and the heated complex is typically a duplex Tlie 

rrri , Q'."^ I?.- ' V^^'''^' temperature ditions suitable for amplification puiposes. 

than about 95° C. This is known as the D Tm. Ordinarily, D i, ,„5ii k» ^ ,u . .u , 

Tm is measured by comparing control oligomickotide btod^ ^ understood that the opUmal temperature will 

ingtocomplemenLryRNAo?DNAS*rbSgof^t " ZenTherher Lv^^^^^^^^^^^ '^T.^ 

oKgonucleotide to the same RNA or DNA, foUowing e g ! k % T ^ u °' "^P*'^^^ 

the method described in Jones et aL, SsSsTfim' • T'^'^' °^ ^"^^ ^ P'^™*'^ 

<^n,-.„ffi, • r -u ■'y^ annealmg temperatures are found with primers having 

Some of the invention nboside and deoxyriboside com- greater than about 18 bases or base analogs), the propoitioi^ 

pounds are fluorescent. The compounds remain fluorescent so of pyrimidines and purines and the lil^^The teTSk 

fnCcXT ? T ^« ^"ble polymerase useful in this system is for examplel^q p^j! 

fftfr H-?^^ ^' 'f^^'^'^J^'' bound to target sequences merase or other suitable hea( stable enzyme. Thus, whaLver 

wim Known memods. priming reactions are conducted conventionally but at a 

One can optionally prepare oUgonucleolides having tan- 65 substantially constant temperature 

dem arrangements of the novel bases. In general, such Not only do the ohgonucleotides of this invention fadli- 

tandem arrangements will contain from 2 to about 10 tate PCR or LCR processes, the fluorescent properties of the 



6,028,183 



53 



primers also facilitate detection of the extension products. 
The extension products are readily separated from the unex- 
tcndcd primers, e.g. on the basis of molecular weight, and 
detected by their fluorescence, thereby avoiding staining 
with agents such as ethidium bromide. In some 
embodiments, the fluorescence is enhanced by using MP's 
comprising the fluorescent substructures of this invention in 
primer extension so that the fluorescent NTPs are incorpo- 
rated into the extension products as well. The polycyclic 
substructure used in the NTP's may be the same or different 
than the one incorporated into the primers. 

We incorporate herein all citations by reference with 
specificity. 

The following examples further illustrate and do not limit 
the invention. 

EXAMPLE 1 

The following example shows synthesis of representative 
starting materials and intermediates for making invention 20 
compounds. 



25 



30 




.OTf ■ 



10 



15 



54 

-continued 




ccH^«rrFA — ^ 



1019-97 



1019-102 



1090-05A 




.CH2CH2CH2NHCB2 



1090-05 



1019-46 



35 Conditions: a: U^, 10% Pd/C, CH3OH, RT; b: HC(0CH3)3, 
methanesulfonic acid, 47.8%; c: TfjNPh, CHXl^/DMF, 
K2CO3, RT, quantitative; d: HfeCCH^NHTFA, PdCPhsP) 
4, Cul, TEA, DMF, 60%; e: H2, 10% Pd/C, CHgCOOEt; f: 

40 con. NH40H/Dioxane (1/1), RT; g: PhCH20C(0)Cl, pyri- 
dine; h: 3N HCl/EtOH (1/1), RT, 1 hr or 40° C, 30 min. 
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(CH2)3NHCBZ 




H2 (in a balloon), 
10% Pd/C, 
O CH3OH, 68% 



1090-22 



(CH2)3NH2 




1. Pmoc-NHS, CH^^ 

2. DMT — Ci, pyridine, 
49% (2 steps) 



I'A, pyndiae, 
O CH2CI2 



1090-26 
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9-(3'-Aminopropyl)phenoxazine 

2-Aminoresorcinol (#1019-43): A ethanoi solution (500 
mL) of 2-mtroresorinol (20 g; 0129 mmole) was hydroge- 
nated in balloon) in the presence of 10% Pd/C (1 g) at 
room temperature overnight. The catalyst was filtered off 
through a celite pad. ITie filtrate was concentrated and 
purified by flash column chromatography, affording 15 .5g, 
96% of 2-aminoresorcinol. Hi NMR (DMSO-dg): 5 8.80 (bs, 
2H), 6.15-6.30 (m, 3H), 3.85 (bs, 2H). 

Compound #1019-45: A trimethyl orthoformate solution 
(100 mL) of 2-aminoresorcinol (6.0 g; 48 mmole) was 
treated with methanesulfonic acid (4.6 g; 48 mmole). After 
30 min stirring at room temperature, the reaction was cooled 
to 0° C. and was quenched with TEA (4,8 g; 48 mmole). The 
reaction mixture was concentrated and purified by column 
chromatography on silica gel, yielding 3.10 g of product, 
47.8%. NMR (DMSO-d^): 6 10.3 (bs, IH), 8.53 (s, IH), 

7.18 (dd, IH, J=8.0 Hz, J=7.8 Hz), 7.12 (d, IH, J«8.0 Hz), 35 
6.73 (d, IH, J«7.8 Hz). 

Compound #1019-46: A CH^Cl^/DMF (N,N- 
dimetfaylformamide) solution (20 mL/10 mL) of #1019-45 
(2.90 g; 21,4 mmole) was stirred with solid I4CO3 (14,8 g; 
107 mmole) at room temperature for 30 min, followed by 
addition of N-phenyltrifluoromethanesulfonimide (8.50 g; 
23.6 mmole). The resulting mixture was stirred at room 
temperature overnight, then diluted with CH^Cl^, washed 
with water twice, dried, concentrated, and purified, affording 
5.76 g, quantitatively, of # 1019-46. ^H NMR (CDCI3): 8 

8.19 (s, IH), 7.66 (d, IH, J=8.2 Hz), 7.48 (i, IH, J-8.2 Hz), 
7.33 (d, IH, J=8.2 Hz). 

N-TFA-propargylamine (#1019-44): Propargylamine (25 
g; 0.45 mmole) was dissolved in CH3OH (500 mL), fol- 
lowed by addition of ethyl trifluoro acetate (84 g; 0.59 mole). 
The resulting mixture was stirred at room temperature 
overnight, then concentrated to dryness. The residue was 
redissolved in CH2CI2 (200 mL), washed with saturated 
NaHCOj aqueous solution. The organic phase was isolated, 
dried, concentrated to a brown residue (liquid). The product 
#1019-44, 54.0 g was distilled off, 78.8%. NMR 
(CDQa): 6 6.60 (bs, IH), 4,18 (m, 2H), 2.38 (s, IH). 

Compound #1019-97: A DMF solution (12 mL) of com- 
pound #1019-46 (4.9 g; 18.3 mmole), N-TFA- 
propargylamine (5.5 g; 36.7 mmole), Pd(PPh3)4 (4.3 g; 3.7 
mmole), Cul (1.78 g; 9.3 mmole) and TEA (3.7 g; 36.7 60 
mmole) was stirred at room temperature for 24 hours. The 
reaction mixture was diluted with CH2CI2 (100 mL), stirred 
with Dowex 1x800 (HCO3— form). The organic phase was 
washed with water, dried, and purified, yielding 2.94 g, 60%, 
of product #1019-97. ^H NMR (CDCI3): 6 8.18 (s, IH), 7.62 65 
(d, IH, J-9 Hz), 7.48 (d, IH, J=9 Hz), 7.36 (t, IH, J-9Hz), 
6.85 (bs, IH), 4.51 (d, 2H, J=3 Hz). 
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Compound #1019-102: Aethylacetate solution (60 mL) of 
#1019-97 (1.7 g; 6.3 mmole) was hydrogenated in the 
presence of 10% Pd/C (200 mg) at room temperature. The 
catalyst was filtered off. The filtrate was concentrated to 
dryness, used for next reaction without further purification. 
^H NMR (CDCI3): 6 8,25 (bs, IH), 8.12 (s, IH), 7.49 (d, IH, 
J=8.2 Hz), 7.36 (dd, IH, J=7.6 & 8.1 Hz), 7.20 (d, IH, J=7.5 
IIz), 3.25 (q, 211), 3.07 (t, 2n, J-6.6 IIz), 2.01 (p, 211). 

Compound #1090-05A: A 1,4-dioxane solution (10 mL) 
of compound #1019-102 (1.0 g; 3.6 mmole) was treated with . 
concentrated NHOH (15 mL) at room temperature for 16 
hrs. The reaction mixture was concentrated to dryness. The 
residue was redissolved in CU2CI2 (30 mL), containing TEA 
(0.74 g; 735 mmole), cooled to 0** C, followed by addition 
of benzyl chlorofonnate (0.75 g, 4.4 mmole). Hie resulting 
solution was stirred at room temperature for 4 hrs, washed 
with H2O, dried and purified by flash column diromatog- 
raphy to give 0.81 g, 71% of #1090-05 A. 

Compound #1090-05: A ethanoi solution (10 mL) of 
compound #1090-05 A (0.81 g, 2.6 mmole) was treated with 
3 NHCl aqueous solution (10 mL) at 40'' C, for 1 hr. The 
reaction mixture was concentrated to dryness, azeotroped 
with CH3CN three times. The crude product was used for 
next reaction, without further purification. "^H NMR 
(CDa3): 8 7.25-7.40 (m, 5H), 7.20 (t, IH), 6.78-6.90 (m, 
2H), 5.07 (s, 2H), 3.17 (t, 2H, J=6.8 Hz), 2.67 (t, 2H, J«7.6 
Hz), 1.79 (p, 2H). 

Compoxmd #1090-18: Compound #1090-15 has been 
described (Lin, "JACS" 117:3873-3874, 1995). A mkture 
of Compound #1090-15 (1.5 g; 2,6 mmole), #1090-05 (2.6 
mmole) and DBU (0.8 g; 5.2 mmole) in CH^Ciz (30 mL) 
was stirred at room temperature overnight The inaction 
mixture was washed with 10% citric add aqueous solution, 
dried and purified on silica gel column chromatography, 
affording 1.21 g, 69% of product. ^H NMR (CDCI3): 8 8.86 
(s, IH), 7.99 (s, IH), 7.29-7.40 (m, 5H), 7.15 (t, IH), 6.98 
(d, IH, J=8 Hz), 7.78 (d, IH, J=7 Hz), 6.27 (t, IH), 
5.20-5.23 (m, IH), 5.09 (s, 2H), 4.78 (bs, IH), 4.30-4.52 
(m, 3H), 3.20-3.30 (q, 2H), 2,65-2.80 (t+m, 3H), 2.10-2.20 
(2 s+m, 7H), 1.80 (p, 2H). 

Compound #1090-22: Compound #1019-18 (1.20 g; 1.78 
mmole) was treated with saturated NH3 in CH3OH (200 mL) 
at room temperature for 5 days. The reaction mixture was 
concentrated to dryness and purified by flash column chro- 
matography affording 0.63 g of product #1090-22, 70%. ^H 
NMR (CDCl3+10% CD3OD): 8 7.30-7.38 (m, 5H), 6.81 (t, 
IH, J«7.3 Hz), 6.73 (d, IH, J=7.9 Hz), 6.60 (d, IH), 6.20 (t, 
IH), 5.10 (s, 2H), 4.36-4.40 (m, IH), 3.9^.0 (m, IH), 
3.70-3.90 (m, 2H), 3.23 (t, 2H), 2.58 (t, 2H), 2,30-2.40 (m, 
IH), 2.10-2.24 (m, IH), 1.70-1.82 (m, 2H). 
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Compound #1090-25: Compound #1090-22 (0.6 g) was 
dissolved in ethanol (10 mL) and was hydrogenated Ql^, ^ 
a balloon) in the presence of 10% Pd/C (50 mg) at room 
temperature for 4 hrs. The catalyst was filtered off, washed 
with CH3OH. The filtrate was concentrated and dried to 
afford 0.3 g, 68% of product, 'H NMR (CD3OD): 6 738 (s, 
IH), 6.7-6.82 (m, 2H), 6.57 (d, IH, J=6.3 Hz), 6.23 (dd, IH, 
J=6,5 Hz, J-6.7 Hz), 4,35-4.38 (m, IH), 3,80-3.90 (m, IH), 
3.70-3.80 (m, IH), 2.60-180 (m, 4H), 2.10-230 (m, 2H), 
1.75-1.86 (m, 2H). 

Compound #1090-26: Compound #1090-25 (0.13 g, 0.34 
mmole) was dissolved in DMF/CHjCl^ (1 mL/3 mL), fol- 
lowed by addition of 9-fluorenylmethyl-N- 
succinimidylcarbonate (FMOC-NHS, 0.14 g; 0.41 mmole). 
After 1 hr of stirring at room temperature, to the reaction 
mixture p}Tidine (140 mg, 1,7 mmole), and DMT-Cl (4,4'- 
dimethoxytrityl chloride; 0.14 g; 0.41 mmole) were added. 
The resulting mixture was stirred at room temperature for 2 
hrs and then diluted with CH2CI2, washed with water twice. 
The organic phase was isolated, dried and purified by flash 20 
column chromatography on silica gel, to give 153 mg, 49%, 
FAB HRMS (high resolution mass spectroscopy) calculated 
for M+H-" 899.416, found 899366. 

Compound #1090-31: Compound #1090-26 (150 mg; 
0.167 mmole) in CH2CI2 (2 mL) was added to a 0^ C. cold 
CH2CI2 solution (1 mL) of PA (0.25 mL of IM CHgCl^ 
solution, 0.25 mmole) and pyridine (66 mg, 0,83 mmole). 
The resulting mixture was then gradually warmed to room 
temperature. After 30 minutes, the reaction mixture was 
diluted with CH^Cl^, washed with 1 M TEAB aqueous 
solution, dried and purified on silica gel, eiuted with 5% 
CH3OH/CH2CI2, then 15%, H2O in CH3CN. The combined 
fractions of product were concentrated and then partitioned 
between CHjCI^ and IM TEAB aqueous solution dried, 
yielding 180 mg of H-phosphonate derivative, quantita- 
tively. 
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rt, 5 days 
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CH3OH, 
15 molai % HCl 
(in dioxane) 
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Compound #1090-68: A CH^Q^— CCl^ (40 mL/40 mL) 
solution of 5-bromo-3'-5'-diacetyl-2'-deoxyuridine (2.0 g, 
5.1 mmole) and triphenyl phosphine (2.0 g, 7.6 mmole) was 
heated at reflux for 3 hrs. The reaction mixture was cooled 
to room temperature, followed by addition of 2-amino-3- 
nitrophenol (0.79 g, 5,1 mmole) and DBU (1.6 g, 10 
mmole). After stirring at room temperature oveniigfat, the 
reaction mixture was washed with 10% citric add aqueous 
solution, dried, concentrated and purified on silica gel. The 
isolated product contained PhgP^O, was treated with satu- 
rated NH3 in CH^OH for 5 days at room temperature. After 
removal of all solvent, the residue (crude #1090-51) was 
redissolved in CH3OH (100 mL), and hydrogenated with 
in a balloon in the presence of 10% Pd/C and 4 N HCl in 
dioxane (200 mL, 0.8 mmole). After 5 hrs, the catalyst was 
filtered off, washed with CH3OH. The filtrate was concen- 
trated and purified by flash column chromatography to 
afford 0.83 g of #1090-68, in 50% yield for 3 steps. NMR 
(DMSO-Cl^): 6 9.6 (b, IH), 6,56 (t, IH, J-8.0 Hz), 6.22 (d. 



IH, J=7.8 Hz), 6.10 (t, IH, J=6.8 Hz), 5.96 (d, IH, J-7.1 
Hz), 5.17 (d, IH), 4.92-5.08 (m, 3H), 4,16-4.20 (m, IH), 
3.70-3.80 (m, IH), 350-3.60 (m, 2H), 2.0 (m, 2H). 

Phthalimidoacetaldehyde #1090-70A: A mixture of N-(2- 
hydroxyethyl)phthalimide (90 mg; 0.47 mmole), DCC (0,15 
g, 0.7 mmole), DMSO (1 mL) and dichloroacetic acid (20 
mL) was stirred at room temperature for 2 hrs. The reaction 
mixture was diluted with CH2CI2, washed with H2O twice, 
dried and concentrated. The crude product #1090-70A con- 
tained DCC-urea by-product, was used for the next reaction 
without further purification. 

Compound #1090-70: A DMF/CH3OH (0.5 mU2 mL) 
solution of #1090-68 (0.2 g, 0.42 mmole), phthalimidoac- 
15 etaldehyde #1090-70A(0.47 mmole) in CH3OH (2 mL) and 
acetic acid (85 mg, 1.4 mmole) was reacted with sodium 
cyanoborohydride (87 mg, 1.4 mmole) at room temperature 
for 4 hrs. The reaction mixture was concentrated, and 
purified on silica gel, yielding 243 mg, 78%, yield of 
20 compound #1154-70. 'H NMR (CDC1,+10% DMSO-d^): 6 
9.45 (s, IH), 7.60-7.68 (m, 2H), 7.50-7.58 (m, 2H), 6.51 (t, 
IH, J=8.2 Hz), 6.12 (d, IH, J=8.2 Hz), 6.06 (t, IH, J=6.7 
IIz), 5.78 (d, in, J-8.0nz), 4.94 (t. III), 4.49-4.52 (m, lit), 
4.18-4.25 (m, 2H), 3.5Q-3.72 (m, 4H), 3.19-3.30 (m, 2H), 
1.85-2.05 (m, 2H). 

Compound #1090-89: Compound #1090-70 (35 mg; 69 
mmole) was dissolved in CH2CI2 (2 mL) and pyridine (0.5 
mL), then reacted with DMT-Cl (28 mg, 83 mmole) at room 
temperature for 3 hrs. The reaction mixture was worked up 
and purified by flash column chromatography, yielding 44 
mg, 78.6% of corresponding 5'-0-DMT-derivatives. ^H 
NMR (CDCI3): 8 7.80-7.90 (m, 2H), 7.68-7.75 (m, 2H), 
7.18-7.50 (m, lOH), 7.0 (b, IH), 6.80-6.90 (m, 4H), 6.77 (t, 
IH), 6.30-6.42 (m, 2H), 5.94 (d, IH, J=8.0 Hz), 5.50 (bs, 
IH), 4.50-4.60 (m, IH), 4.10-4.18 (m, IH), 3.86^.0 (m, 
2H), 3.25-3.30 (2s, 6H), 3.28-352 (m, 4H), 2.58-2.68 (m, 
IH), 2.20-2.32 (m, IH), 

Compound #1090-91: Compound #1090-89 (44 mg) was 
converted into 3'-H-phosphonate in normal fashion, 52 mg, 
in 77% yield. 
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OH 
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a:R= CH2CH2NHCBZ, 

b:R=: CH2CH2CH2NHCBZ, 

c:R= ai2CH2N(CH3)2 



2. FMOC NHS, 

DMF,RT 

3. DMT CI 

Pyridine, RT 




DMTO 



a:R'= CH2CH2NHFMOC, 

b:R'= CH2CH2CH2NHFMOC, 

c: R' = CH2CH2N(CH3)2 

• Steps 1 & 2 are only applied to 
-CH2CH2NHCBZ 



R = 



and 



-CH?CH7CH7NHCBZ 



50 

N-CBZ-3-aminopropanoi (Compound #1154-74, CBZ; resulting solution was gradually warmed to room tempera- 
benzyloxycarbonyl): A CH2CI2 solution (300 mL) of ture and stirring was continued at room temperature over- 
3-aminopropanol (10 g, 0.133 mole) and TEA (20 g, 0.2 night. The reaction was worked up and purified by flash 
mole) was cooled to 0"" C, followed by slow addition of column chromatography on silica gel, affording 22.8 g in 
CH2CI2 solution (25 mL) of benzyl chloroformate (25 g, 55 66% yield of title compound. % NMR (CDCl:^): 6 
0.147 mole). The resulting solution was gradually warmed 7.34-7.40 (m, 5H), 5.20-5.32 (m, IH), 5.14 (s, 2H), 3.74 (q, 
to room temperature. The stirring was coatinued at room 2H), 3.37 (q, 2H), 2.38 (bs, IH). 

temperature overnight. The reaction mixture was washed 3'-5' -Diacetyl-N'^-(2",6"-dihydroxypfaenyl)-2'-deoxy-5- 

with H2O, dried and purified by flash column chromatog- bromo-cytidine (Compound #1154-093): A Cd^-^M^Cl^ 
raphy (on silica gel, CH3OH— CH2CI2) to yield 16.4 g, solution (150 mL— 150 mL) of 5-bromo-3'-5'-diacetyI-2'- 
58.9% of title compound. NMR (CDCI3): 6 7.26-7.40 ^° deoxyuridine (15 g, 383 mmole) and PhgP (15 g, 57.5 
(m, 5H), 5.10 & 5.0-5.10 (s+m, 3H), 3.68 (q, 2H), 336 (q, mmole) was heated at reflux under N2 for 3 hrs. The reaction 
2H), 2.58 (t, IH), 1.70 (p, 2H). mixture was cooled to room temperature, followed by addi- 

N-CBZ-2-aminoethanol (Compound #1154-104): To 0* tion of 2-amino-resorcinol (5.2 g, 42 mmole) and DBU (8.7 
C. cold of CH2CI2 solution (300 mL) of 2-aminoethanol g, 57,5 mmole). The resulting solution was stirred at room 
(10.8 g, 0.177 mole) and TEA (26.8 g; 0.265 mole) was 65 temperature overnight. The reaction mixture was concen- 
added slowly a CH2CI2 solution (50 mL) of benzyl chloro- trated to about volume, then poured into citric acid 
formate (33.2 g, 0.194 mole). After complete addition, the aqueous solution (7.5 g in 300 mL HgO) with vigorously 
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stirring. The precipitate was filtered off, washed with H2O, Compound 13c: Compound 11c (130 mg, 228 mmole) 

CH2CI2 then CH3CN, dried in a vacuum oven overnight, was treated with saturated NH3 in CH3OH (30 mL). After 2 

wc^cd 12.9 g, 67.8% yield of title compound. NMR days at room temperature the reaction mixture was concen- 

(DMSO-dg): 6 9.63 (s, 2H), 8,21 (s, IH), 8.0 (s, IH), 6.89 trated to dryness to give crude 12c. The crude 12c was then 

(t, IH, J=8,l Hz), 6.33 (d, 2H, J«8.1 Hz), 6.10 (t, IH, J«7.4 5 dissolved in pyridine (2 mL) containing TEA (89 mg, 1.2 

J?m ?'n^io nfe 4.12-4.30 (m, 3H), 2,30-2.40 (m, ^.^lole), followed by addition of DMT-Cl (115 mg, 340 

2H), 2.06 &2.03 (2s, 6H). ..... mmole). After 2 hrs at room temperature, the reaction 

.nH^^rf H Ti 1 r T T^M 1 r f ^^^^^^^ concentrated, partitioned between CH^Cl^ and 

nl™?e' ^ ^ ^ i^'lTr ^^turated NaHC03 aqueous solution, dried and purifi^ on 

proper protected ammo alcohol, Ph^P (1.5 eq) and diethyl 10 . \, ^tt r^TT/^TT ^1 .i. 

azodicarboxylate (DEAD, 1.5 eq), was added compound '^'^'r^U^^^^^^ t Cn30II/CIl2Cl,, hen 10% 

#1154-093 (1 eq). The resulting mixture was stirred at room F^S^^^i?^^^ ^f' y^^^^- 

temperature overnight. The reaction mixture was washed ^ ^MR (CDCl,): 8 7.20-7.50 (m, lOH), 6.86 (dd, 4H), 

with H2O, dried and purified by a silica gel (eluted with ^'^^ ^'^^'^ ^^)' ^-^^ ^'^-^ ^2)' ^-^^ (^^ 

CH3OH— CH^cy to yield compound lla-llc, normaUy is J'^-^ ^^)' ^''7.9 Hz), 4.52-4.60 (m, IH), 4.14 

they contaminated with Ph3P=0. However, the crude com- (q» IH). 4.0-4.10 (m, 2H), 3.77 & 3.79 (25, 6H), 3.32-3.45 

pound lla-llc was directly used for compounds 12a-12c i^^ 2H) 2.62-2.74 (m, 3H), 2,39 (s, 6H), 2.20-2.34 (m, IH). 

without further purification. Compound 13a (1154-175): A methanol solution (150 

Compound 12a-12c: Crude compound lla-llc was mL) of compound 12a (1154-111, 2.2 g, 4.3 mmole) was 

treated with saturated NH3 in CH3OH at room temperature 20 hydrogenated (H^ in a balloon) in the presence of 10% Pd/C 

for 1-3 days. After removal of solvent, the reaction mixture at room temperature overnight. The catalyst was filtered off, 

was purified by flash column chromatography on silica gel washed with CH3OH. The filtrate was concentrated to 

(eluded with CH30H~CH2Cl2), to yield compounds dryness. The crude unprotected 12a derivative was dissolved 

12a 12c. .r.. ..JTA ...^ . DMF/CH2CI, (10 mL/5 mL), and reacted with FMOC- 

n o^. ^^^ti^^^^ n^' (1.73 1 5.1 mmole) at room temperature for 1 hr. The 

f ' n'm^ ft S^^t ^'^'^'^ ^-^^'^ di^^t^^ with cici, (50 mL), con- 

12 mmole) and 1154-104 (3.0 g, 15.3 mole). H NMR , . . ^ 1 x ^ n j i_ j^-.- 

(DMSO-d,): 6 9.82 (bs, IH), 7.81 (bs, IH), 7.69 (bs, IH), T^^.^^^'^lTo^^ t "^T ll ^""^T^ ^^^^^^'^ 

7.25-7.36 (m, 5H), 6.82 (t, IH, 1=8.2 Hz), 6.60-6.80 (m, ^^^"^^ (^'^ ^'^ "^^^^^^ ^ 

2H), 6.46 (d,lH,J=8.2 Hz), 6.14 (t,lH,J=6.7 Hz), 5.22 (d, 30 worked up and purified by flash column 

IH), 5.06-5.13 (s+s, 3H), 4.20-4.25 (m, IH), 3.94-4,0 (m, chromatography, affordmg 2.87 g, 73% of compound 13a. 

2H), 3.79-3.81 (m, IH), 3.58-3.65 (m, 2H), 3.40-3.50 (m, calculated for C53H45N4O10 (M+H*) 901.345, 

2H), 1.98-2.20 (m, 2H). found 901.344. 

Compound 12b: Compound 12b (0.30 g; 57.6%) was Compound 13b: A ethanol solution (15 mL) of compound 

prepared from 1154-93 (0,5 g; 1.0 mmole) and compound 35 12b (1154-077) was hydrogenated (H2 in a balloon) in the 

1154-074 (0.31 g; 1.5 mmole). ^H NMR (CD3OD): 6 7.65 presence of 10% Pd/C at room temperature for 4 hrs. 

(s, IH), 7.24-7.36 (m, 5H), 6.79 (t, IH, J=8.6 Hz), 6.54 (d. Catalyst was filtered off. The filtrate was concentrated to 

IH, J=8.2 Hz), 6.36 (d, IH, J=8.2 Hz), 6.23 (t, IH), 5.06 (s, yield unprotected 12b derivative. Unprotected 12b deriva- 

2H), 4.38-4.45 (m, IH), 3.92-4.11 (m, 3H), 3.81 (q, 2H), live (125 mg, 0.32 mmole) was dissolved in DMF (2 mL) 

3.38 (t, 2H), 2.30-2.40 (m, IH), 2.10-2.22 (m, IH). 40 and reacted with FMOC-NHS (130 mg, 0.38 mmole) at 

Compound 11c: Compound 11c (86 mg; 37.7%) was room temperature for 1 hr. The reaction mixture was diluted 

prepared from Compound #1154-093 (200 mg, 0.4 mmole) with CH2CI2 (5 niL) containing pyridine (132 mg, 1.7 

and N,N-dimethylaminoethanol (43 mg, 0.48 mmole). ^H mmole), followed by addition of DMT-Cl (135 mg, 0.38 

NMR (CDCI3): a 10.9 (s,lH), 8.49 (s, IH), 7.97 (s, IH), 7.11 mmole). After 2 hrs, the reaction mixture was diluted with 

(t, IH, J-8.2 Hz), 6.77 (d, IH, J«8.2 Hz), 6.55 (d, IH, J-8.0 45 CH^Cl^ (10 mL), washed with H^O, dried and purified by 

Hz), 6.32 (dd, IH), 5.23-5.27 (m, IH), 4.40-4,46 (m, 2H), flash column chromatography to give compound 13b, 86 

4.33-^.37 (m, IH), 4.20 (t, 2H, J-5.5 Hz), 2.70-2.80 (m+t, mg, 29% yield. FAB HRMS calculated for C54H51N4O10 

3H), 2.30 (s, 6H), 2.20 (s, 3H), 2.10-2.20 (m+s, 3H). (M+H") 915.360, found 915.361. 
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HOCH2CH2OH, 

2,6-Di-t-Butyl- /^^^ 

4-methylpyridine J i 

■^"2Br AgOTf,RT, ' ' 
3 hr, 68% 
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r-,x-^\ 2.DMF,1 
1 



1. CH3SO2CI, 

CH2a2,TEA, /==N 

RT 

NaN3, 



#1154-112 



#1154-125 



AcO' 




NH. 



PI13P, CCI4- 



"XT' 



DBU, CH2CI2, 
RT, 2 days 



OAc 



#1154-133 



NH3/CH3OH 
RT, 3 days 



1. H2, 10% Pd/C (#1154-148) 




y 2. FMOC NHS, 

N==^ / ^ DMF, RT 

\r if 



3. DMT — CI 

Pyridine, RT 



OH 

#1154-138 
R= CH2OCH2CH2N3 



#1154-126 

HCi aq. - EtOH 
50* C. 



NH2 



.N3 



AcO' 





#1154-149 
R' = CH2OCH2CH2NHFMOC 



Compound #1154-108: 2-Ammo-m-cresol (2.1 g, 17.6 
mmole) was dissolved in trimethylorthoformate (15 mL), 
followed by addition of methanesuifonic acid (0.32 g, 3.4 
mmole, 20 molar %). The resulting solution was stirred at 
room temperature for 3 hrs. The reaction mixture was 
neutralized with TEA (0.4 g, 4.1 mmole), concentrated and 
purified on silica gel, eluted with 2% CH3OH/CH2CI2, 
affording 1,6 g, 72% yield. NMR (CDCI3): 6 8.10 (s, 
IH), 7.43 (d, IH, J-8.2 Hz), 7.31 (dd, IH, J-7.3 Hz, J-8.2 
Hz), 7.20 (d, IH, J-7.3 Hz), 2.68 (s, 3H). 

Compound #1154-112: Compound #1154-108 (4.3 g, 32 
mmole) was dissolved in CCI4 (50 mL), followed by addi- 
tion of N-bromosuccimide (6.3 g, 35 mmole) and AIBN 
(240 mg). The resulting mixture was refluxed for 2 hrs under 
N2. The reaction mixture was washed with HjO, dried, 
concentrated and purified by flash column chromatography 



on silica gel, eluted with 2% CH3OH/CH2CI2, affording 5.67 
g. of product, 82.7%. 'H NMR (CDQs): 6 8.16 (s, IH), 7.55 
(d, IH), 7.40-7.50 (m, 2H), 4.90 (s, 2H). 

55 Compound #1154-125: A CH2CI2 solution (20 mL) of 
Compound #1154-112 (0.28 g, 1.3 mmole), ethylene glycol 
(0.82 g, 13.2 mmole), silver triflate (0.5 g, 1.95 mmole) and 
2,6-di-t-butyl-4-methylpyridine (0.52 g, 2.6 mmole) was 
stirred at room temperature for 3 hrs. The precipitate was 

60 filtered off, washed with CH2CI2- The filtrate was washed 
with H2O, dried, concentrated and purified on silica gel, 
eluted with 3% CH3OH/CH2CI2, affording 0.17 g, 68% of 
product #1154-125. NMR (CDQs): 5 8.14 (s, IH), 7.55 
(d, IH), 7.36-7.40 (m, 2H), 4.97 (s, 2H), 3.70-3,85 (m, 4H), 

65 Compound #1154-126: Compound #1154-125 (170 mg, 
0.88 mmole) was dissolved in CH2CI2 (5 mL) containing 
TEA (0.27 g, 2.6 mmole) and treated with CHgSOja (0.15 
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g, 132 mmole). After 30 min at room temperature, the 
reaction mixture was washed with H2O, dried and concen- 
trated. The residue was dissolved in DMF (2 mL) followed 
by addition of sodium azide (86 mg, 1.3 mmole). The 
residtii^ solution was heated at reflux for 2 hrs. The reaction 5 
mixture was then partitioned between CRJCI2 and water. 
The organic phase was isolated, dried and purified on silica 
gel, eluted with 1% CH3OH/CH2CI2, to afford #1154-126, 
162 mg, 84%. NMR (CDCI3): 6 8.10 (s, IH), 4.99 (s, 
2H), 3.76 (t, 2H, J-4.8 Hz), 3.45 (t, 2H, J=4.6 Hz). 10 

Compound #1154-132: Compound #1154-126 (1.1 g, 5.0 
mmole) was treated with 3N HCl aqueous solution (10 mL) 
and ethanol (10 mL) at 50** C. for 1 hr. The reaction mixture 
was concentrated to dryness, azeotroped with CH3CN three 
times, used for next reaction without further purification. is 
NMR (DMSO-dg): 6 10.6 (bs), 6.85-7.20 (m, 3H), 
4,6(M.70 (m, 2H), 3.60-3.72 (m, 2H), 3.50-3.60 (m, 2H). 

Compound #1154-133: 5-Bromo-3',5'-diacetyl-2'- 
deoxyuridine (0.98 g, 2.5 mmole) and PhsP (0.8 g, 3.0 
mmole) were dissolved in CCiyCH^Cl^ (10 mL/10 mL), 20 
and were heated at reflux for 3 hrs. The reaction mixture was 
cooled to room temperature, followed by addition of #1154- 
132 (0.62 g, HCl salt, 2.5 mmole) and DBU (0.78 g, 5.1 
mmole). Hie resulting mixture was then stirred at room 
temperature for 2 days, washed with 5%, citric add aqueous 25 
solution, dried, concentrated and purified on silica gel. The 
isolated product contained Ph3P=0, without further puri- 
fication and used for next reaction. 



Compound #1154-138: The crude #1154-133 
(Theoretically 2.5 mmole) was treated with saturated NH3 in 
CH3OH at room temperature for 3 days, concentrated to 
dryness, and purified in silica gel, yield 0*28 g of 28% yield 
of product (2 steps). NMR (CD3OD): 6 7.56 (bs, IH), 
6.78-6.85 (m, 2H), 6.64-6.70 (m, IH), 6.19 (t, IH, J=6,4 
Hz), 4.54 (s, 2H), 4.35^.40 (m, IH), 3.88-3.94 (m, IH), 
3.60-3.85 (m, 4H), 3.62 (t, 2H), 2,22-2.35 (m, IH), 
2.10-2.20 (m IH). 

Compound #1154-148: A CH3OH solution (25 mL) of 
Compound #1154-138 (160 mg, 0.38 mmole) was hydroge- 
nated (H2 in a balloon) in the presence of 10% Pd/C at room 
temperature for 2 hrs. The catalyst was filtered off, washed 
-with CH3OH. The filtrate was concentrated to dryness to 
afford #1154-148. 'H NMR (CD3OD): 8 7.63 (s, IH), 
6.81-6.86 (m, 2H), 6.70-6.74 (m, IH), 6.22 (t, IH), 4.65 (s, 
2H), 4.35-4.40 (m, IH), 3.89-3.93 (m, IH), 3.68-3.84 (m, 
2H), 3.55 (t, 2H), 2.80-2.90 (m, 2H), 2.25-2.36 (m, IH), 
2.10-2.21 (m, IH). 

Compound #1154-149: A DMF solution (2 mL) of com- 
pound #1154-148 (170 mg, 0.43 mmole) was treated with 
FMOC-NHS (150 mg, 0.52 mmole). After 2 far reaction at 
room temperature the reaction mixture was diluted with 
CH2CI2 (10 mL) containing pyridine (0.7 g, 8.7 mmole), 
followed by addition of DMT-Cl (0.22 g, 0.65 mmole). After 
1 hr, the reaction was worked up and purified by flash 
column chromatography, to yield 232 mg, 58.2% of 5'-0- 
DMT-N-FMOC derivative #1154-149. FAB HRMS calcu- 
lated for C54H5iN40io (M+H-^) 915.360, found 915.359. 




(aSi(i-Pr)2)2 O 




#1154-135 
R= OCH2ai2NH2 




#1154-136 
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R' = OCH2CH2NH- 




OAc 



#1154-144 



25 



3'-5'-Diacetyl-0^-sulfonyl-2'-deoxyuridine (#1154-136): 
ACH2CI2 solution (10 mL) of 3'3'-diacetyl-2'-deoxyuridine 
(0.76 g, 2.4 mmole), 2~mesitylenesulfonyl chloride (1.0 g, 20 
4.8 mmole), TEA (1,23 121 mmole) and catalytic amount 
of DMAP(0.1 g) was stirred at room temperature overnight. 
The reaction mixture was diluted with CH2CI2, washed with 
5% citric acid aqueous solution, dried and purified on silica 
gel, eluted with CH2C[2^ 35% ethyl acetate in CH^Cl^, to 
give the title compound, 0.76 g in 63% yield. NMR 
(CDQa): 8 8.0 (d, IH, J-9 Hz), 6.99 (s, 10 2H), 6.13 (d, IH, 
J-9 Hz), 6.07 (t, IH, J-6.0 Hz), 5.15-5.20 (m, IH), 4.33 (s, 
3H), 2.70-2.85 (m+s, 7H), 2.73 (s, 3H), 2.09 (s, 3H), 
1.98-2,07 (m+s, 4H). 

3'-5'-0-(l,l,3,3-Tetraisopropyl.l,3-disiloxyanediyl)-9"- 
(aminoethoxy)phenoxazine (#1154-135): Compound 14 

(see preparation of 12a and 13a) was dissolved in DMF/ 
pyridine (2 mL/2 mL) followed by addition of 1,1,3,3- 
tetraisopropyl-dichlorodisilane (0.28 g, 0.91 mmole). The 
stirring was continued at room temperature for 3 hr. The 
reaction mixture was concentrated to dryness. The residue 
was used for next reaction without further purification. 

Compound #1154-137: A CH2CI2 solution (10 mL) of 
compound #1154-136 (0.3 g, 0.61 mmole), compound 
#1154-135 (crude, 0.61 nrniole), and DBU (0.46 g, 3.0 
mmole) was stirred at room temperature overnight. The 
reaction mixture was washed with 5% citric acid aqueous ^5 
solution, dried, and purified by flash column chromatogra- 
phy (sihca gel, CH3OH/CH2CI2), to give 0.21 g, 38% of 
product #1154-137, FAB LRMS calculated for 
C42H6iN60i3Si2 (M+H*) 912, found 913. 

Compound #1154-144. Compound #1154-137 (0.20 g, 
0.22 mmole) in THF 1 mL, was treated with 1 M BU4NF in 
CH2CI2 solution (0.87 mL, 0.87 mmole) at room tempera- 
ture for 30 min, then concentrated to dryness. The residue 
was ledissolved in CH2CI2 (5 mL) containing pyridine (173 55 
mg; 2.2 mmole), followed by addition of DMT-Cl (330 mg, 
1 mmole). After 2 hrs, the reaction mixture was washed with 
saturated NaHCOg aqueous solution, dried, and purified by 
flash column chromatography (silica- gel, CH30H/CH2Cy 
to yield 0,14 g, 65% of product #1154-144. FAB HRMS 



in Table I below. This sequence is complementary to a the 
12 base RNA sequence expressed by the large SV40 virus 
large T antigen gene, 5* GTA GTG AGG AGG 3' (SEQ ID 
NO. 1). In the tested oligonucleotides, which are shown in 
Table I, each linkage was a phosphorothioate linkage and all 
sugars were 2'-deoxyribose. 



30 



35 



40 



The tested oligonucleotides had the base sequences shown 
in Table I. In the Table I oligonucleotides, bases were 
designated as follows. 



Base 




abbreviation 


base 


C 


.5-Tnethylcytosine 


U 


5- (1-pTopy nyl)uiaca 


A 


adenine 


T 


thymine 


D 


5-(l-propynyl)c>'tosine 


Z 


structure (58) 


V 


structure (61) 


X 


structure (57) 


Y 


structure (60) 



50 



Structures of tricyclic bases described in this example and 
in following examples are as follows. 



H2N(CH2)3 



(55) 



60 



calculated 
973.364. 



for 



-'5l"S3^^6^14 



(M+H*) 973.362, found 



EXAMPLE 2 



Antisense inhibition of T-antigen expression. The tested 
DNA oligonucleotide analogs, had the base sequence ^own 
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-continued 

(60) 




Phenoxazine Tricyclic 
Cytidine 




The values obtained from the RNA hybrids and the 
intracellular IC50 values for T-antigen inhibition derived 
from microinjection analysis are shown below. In all cases, 
P-galactosidase inhibition was concurrently tested as an 
internal control and we saw no inhibition at 5 /M., the 
highest concentration tested. We measured inhibition of T 
antigen expression in CV-1 cells in tissue culture essentially 
as described (Wagner et al., "Science'' 260:1510-1513 
1993). We measured the AT^ C.) values relative to control 
ODNl essentially as described (Jones et al. "JOC" 58:2983 
1993), 



65 



6,028,183 

75 76 



TABLE I 



ODN 






AT 




# 


BASE 


ODN sequence 5' to 3' 




/O p \* 




1 


C control 


CCU-CCU-CAC-UAC 


SEQ ID NO. 2 


(65.0) 


0.5-1.0 


2 


D control 


DDU-DDU-DAD-UAD 


SEQ ID NO. 3 


+13.0 


0.25 


3 


(57) 


XCU-CCU-CAC-UAC 


SEQ ID NO. 4 


+2.5 


1.0 


4 


(57) 


CXU-CCU-CAC-UAC 


SEQ ID NO. 5 


+9.5 


0.05 


5 


(57) 


CCU-XCU-CAC-UAC 


SEQ ID NO. 6 


+6.5 


0.1 


6 


(57) 






+9.0 


0.05 


7 


(57) 


OCU-CrU-XAC-UAC 


SEQ ID NO. 8 


+6.5 


0.05-0.1 


8 


(57) 


CCU-<XU-CAX-UAC 


SEQ ID NO 9 


+5.0 


0.1 


9 


(57) 


CCU-CCU-CAC-UAX 


SEQ ID NO. 10 


+55 


2,0 


10 


mismatch 


CCX-CrU-UAC-UAC 


SEQ ID NO. 11 


-12.5 


>0J 


11 


T control 


CCT-CCT-CAC-TAC 


SEQ ID NO. 12 


-11.0 


>1.0 


12 


T-(57) 


CCT-CCT-XAC-TAC 


SEQ ID NO. 13 


-2.0 


0.25 


16 


D 


CCU-CCIJ-DAC-UAC 


SEQ ID NO. 14 


0 


0.5-1 


17 


(60) 


CCU>CCII-YAC-UAC 


SEQ ID NO. 15 


+0.5 


>0.5 


18 


(58) 


CCU-CCU-ZAC-UAC 


SEQ ID NO. 16 


+5.0 


0.3 


19 


(61) 


CCU-CCU-VAC-UAC 


SEQ ID NO. 17 


+1.0 


0.35 



*AT„, Relative to control ODNl 



The results above show that the presence of an invention 
base elicits potent and specific antisense inhibition of target 
gene expression (compare ODNl vs. 0DN4 and 0DN6). 
Incorporation of one invention base at an internal C position ^ 
resulted in a potency enhancement which exceeds that 
obtained from the substitution of seven 5-(l-propynyl) 
cytosine bases for 5-methylcytosine (compare 0DN2 and 
0DN4). The 0DN12 results showed that oligonucleotides 30 
containing limited base substitutions and only a single 
invention base may have significant antisense potency. 
Phosphorothioate linked oligodeoxynucleotides containing 
propynyl-substituted pyrimidine bases were previously the 
most potent class of antisense agents that workers had 
described. 

Oligonucleotides containing a base of structure (3) at an 
internal position in the oligonucleotide where was (a) the 40 
(S) isomer of — O— CH2~C*H(CH3>~NH2 or (b) the (R) 
isomer of — O— CHj— .C*H(CH3)— NH2 were prepared 
and tested for binding affinity in a similar manner. Both 
oligonucleotides had an increased binding affinity compared 
to the control oligonucleotide. 



EXAMPLES 

50 

Antisense inhibition of p27*^^ expression. The sequence 
of 0DN13 was 5' UGGCUCUCCUGCGCC 3^ (SEQ ID NO. 
18) and it contained phosphorothioate linkages, all sugars 
were 2 -deoxyribose, all G positions contained guanine, all 55 
U positions contained 5-(l-propynyl)uracil and all C posi- 
tions contained 5-(l-propynyl)-cytosine. 0DN14 had the 
same sequence and base composition as 0DN13, except that 
all C positions contained 5-methylcytosine instead of 5-(l- 
propynyl)cytosine and thymidine instead of 5-(l-propynyl) 60 
uracil. 0DN15 had the same sequence and base composition 
as 0DN14, except that the 9th C position from the 5' end 
contained a (57) base instead of 5-methylcytosine. Each 
ODN was tested for its capacity to inhibit p27 gene expres- 
sion in CV-1 cells in cell culture using cationic lipid to 65 
deliver the oligonucleotides into the cells, essentisdly as 
described (Coats "Science" 272:877-^880, 1996). 



ODN 


IC5o(nM) 


13 


10 


14 


>20 


15 


<5 



The results showed that 0DN15 containing one (57) base is 
more potent than the all-propyne derivative 0DN13. 
Previously, phosphorothioate-liidced oligonucleotides con- 
taining 5-l(propynyl)-modified pyrimidine bases were the 
among the most potent reported class of antisense agents. 

0DN13 was tested in rats and, in a 10 day toxicological 
evaluation, it was found that the MTD (maximum tolerated 
dose) was 0.6 mg/kg/d i.v. 0DN15 was tested in the same 
manner with no toxicity observed at a dose of 6.0 mg/kg/d 
i.v. 

EXAMPLE 4 

Increased oligonucleotide binding affinity and specificity. 
We made a 10-mer oligonucleotide, ODN22, having the 
base sequence, 5' TCTCCCTCTC 3^ (SEQ ID NO. 19). 
ODN22 contained only phosphorothioate linkages, all sug- 
ars were 2'-deoxyribose, bases designated T were thymine, 
bases designated C were 5-methylcytosine. ODN23 was the 
same as ODN22, except that the 5th base position from the 
5' end contained 5-(l-propynyl)cytosine (base designated 
"D" in Table IV). ODN24 was the same as ODN22, except 
that the 5th base position from the 5' end contained a 
structure (60) base. ODN25 was the same as ODN22, except 
that the 5th base position from the 5' end contained a 
structure (55) base. ODN26 was the same as ODN22, except 
that the 5th base position from the 5' end contained a 
strucUire (57) base. ODN27 was the same as ODN22, except 
that the 5th base position from the 5' end contained a 
structure (59) base. ODN28 was the same as ODN22, except 
that the 5th base position from the 5' end contained a 
structure (58) base. ODN29 was the same as ODN22, except 
that the 5th base position from the 5* end contained a 
structure (61) base. ODN30 was the same as ODN22, except 
that the 5lh base position from the 5' end contained a 
structure (65) base. 

We measured the T„ C C.) of ODN22-ODN30 using 4 
different oligonucleotides: A complementary RNA oligo- 
nucleotide (0DN31), an RNA oligonucleotide (ODN32) 
having adenine at the 11th position from its 5' end, i.e., a 
single A: test base mismatch, a complementary DNA oligo- 
nucleotide (ODN33), and a DNA oligonucleotide (ODN34) 



